Exponential structure of income inequality: evidence from 67 countries

Tao, Yong; Wu, Xiangjun; Zhou, Tao; Yan, Weibo; Huang, Yanyuxiang; Yu, Han; Mondal, Benedict; Yakovenko, Victor M.

doi:10.1007/s11403-017-0211-6

Exponential structure of income inequality: evidence from 67 countries

Regular Article
Published: 27 December 2017

Volume 14, pages 345–376, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Economic Interaction and Coordination Aims and scope Submit manuscript

Exponential structure of income inequality: evidence from 67 countries

Download PDF

Yong Tao ORCID: orcid.org/0000-0003-2160-9467¹,
Xiangjun Wu³,
Tao Zhou⁴,
Weibo Yan⁵,
Yanyuxiang Huang²,
Han Yu²,
Benedict Mondal⁶ &
…
Victor M. Yakovenko⁷

2672 Accesses
39 Citations
20 Altmetric
1 Mention
Explore all metrics

Abstract

Economic competition between humans leads to income inequality, but, so far, there has been little understanding of underlying quantitative mechanisms governing such a collective behavior. We analyze datasets of household income from 67 countries, ranging from Europe to Latin America, North America and Asia. For all of the countries, we find a surprisingly uniform rule: income distribution for the great majority of populations (low and middle income classes) follows an exponential law. To explain this empirical observation, we propose a theoretical model within the standard framework of modern economics and show that free competition and Rawls’ fairness are the underlying mechanisms producing the exponential pattern. The free parameters of the exponential distribution in our model have an explicit economic interpretation and direct relevance to policy measures intended to alleviate income inequality.

Household income distribution in the USA

Article 28 March 2016

The fall in income inequality during COVID-19 in four European countries

Article Open access 08 August 2021

Declining inequality in Latin America? Robustness checks for Peru

Article 01 March 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Economic inequality is a universal phenomenon in human societies. Although there are broad patterns of economic inequality between countries, their sources are poorly understood and hotly debated (Kuznets 1955; Acemoglu and Robinson 2009; Autor 2014; Piketty and Saez 2014; Ravallion 2014; Nishi et al. 2015). To explain the origin of economic inequality, researchers put forward different mechanisms, such as institutional structures (Acemoglu and Robinson 2009; Piketty and Saez 2014), technological progress (Acemoglu and Robinson 2009; Autor 2014), economic growth (Ravallion 2014), psychological factor (Nishi et al. 2015), and so on. In fact, because economic inequality involves many different aspects (e.g. income, wealth, social status, and so on) in human societies, seeking a universal pattern of economic inequality seems an impossible task. Nevertheless, some researchers tried to find universal patterns in income inequality. An influential economist Vilfredo Pareto proposed that income distribution in a society is well described by a power law (Pareto 1897). Although many studies have confirmed that the high-income class of populations follows a power law (Mandelbrot 1960; Kakwani 1980), there is increasing evidence showing that it does not apply to the majority of population with lower income. Using income data for USA, Yakovenko and Rosser (2009) have shown that the US society has a well-defined two-class structure (Dragulescu and Yakovenko 2001a, b; Silva and Yakovenko 2005; Yakovenko and Rosser 2009; Banerjee and Yakovenko 2010): the great majority of population (low and middle income class) follows an exponential law, while the remaining part (high income class) follows a power law. Dragulescu and Yakovenko proposed a thermal equilibrium theory based on statistical mechanics to explain the exponential pattern of income distribution (Dragulescu and Yakovenko 2000), which has won more and more support from recent empirical studies (Nirei and Souma 2007; Derzsy et al. 2012; Jagielski and Kutner 2013; Shaikh et al. 2014; Shaikh 2016; Oancea et al. 2016). However, it should be noted that the exponential law does not fit the super-low income data, which are usually fitted by log-normal or gamma distributions (Banerjee et al. 2006; Chakrabarti et al. 2013). Moreover, although the exponential law is quite successful in describing the low and middle income data, the mechanism of thermal equilibrium is questioned by mainstream economists (Cho 2014). These economists argue that the thermal theory of income distribution lacks solid economic foundation (Cho 2014), and so is unhelpful in making policy recommendations. In response to this criticism, we show in this paper that the exponential law of income distribution can be derived from the principles of free competition and Rawls’ fairness (Rawls 1999), thus giving it a solid economic foundation (Tao 2015, 2016). Because we introduce a rigorous economic treatment, the scope of applicability of the exponential distribution is determined, and we can explain why it fails to fit super-low and high income data.

Furthermore, our results can be formulated as a powerful complement for the existing literatures. Understanding of the social impact and quantitative characterization of income inequality is a subject of great social and political importance. For the quantitative characterization of inequality, while there are plenty of case-by-case studies (Piketty and Saez 2003; Piketty 2003; Banerjee et al. 2006; Piketty and Qian 2009; Clementi et al. 2010, 2012; Jagielski and Kutner 2013; Shaikh et al. 2014; Saez and Zucman 2016; Oancea et al. 2016), most of them do not recognize the underlying universal quantitative structure of income inequality, i.e., do not see the forest for the trees. Here we present overwhelming empirical evidence, derived from the datasets for 67 countries, that the low and middle part of income distribution follows a universal exponential law. More importantly, relative to other existing distributions, the fitting parameters in our distribution have an explicit economic interpretation and direct relevance to policy measures intended to alleviate income inequality. For the social impact of inequality, there are two strands of literatures. One line focuses on how the market structure and institution influences income inequality (Katz and Autor 1999; Autor et al. 2008; Heathcote et al. 2010; Moretti 2013). The other line investigates the mechanism of redistribution reducing income inequality (Piketty and Saez 2003; Piketty 2003; Atkinson et al. 2011; Golosov et al. 2013; Jones 2015). In this paper, we make an attempt to combine these two lines. On one side of the empirical investigation, we show that free economy exhibits a universal two-class structure: The great majority of population (low and middle income class) follows an exponential law, while the remaining part (high income class) follows a power law. On the other side of theoretical research, we show that the exponential income structure is a result of combining free competition and Rawls’ fairness, while the power income structure is due to the rule “the rich get richer” (i.e. the Matthew effect). To reduce the degree of inequality, we propose that the redistribution policy should be based on the principle of levying a tax on high-income class to pay the unemployment compensation, in line with Piketty’s policy propositions.

2 Exponential income distribution

In fact, mathematical apparatus of modern economics has been strongly influenced by physics. Following Newton’s paradigm of classical mechanics, the famous economist Leon Walras developed a set of equations that describe economic equilibrium (Walras 2003). These equations opened the paradigm of “neoclassical economics” and later were perfected by Arrow and Debreu (1954). Now these equations are called the “Arrow–Debreu’s general equilibrium model” (ADGEM), which is the well-known standard model of modern economics (Mas-Collel et al. 1995). Using such a model, one can illustrate why the equilibrium allocation of social resources, in which every social member obtains maximum satisfaction, exists in an “ideal institutional environment” that ensures reasonable private property rights and judicial justice. Following a mainstream economic approach, we use ADGEM in this paper to study the equilibrium income allocation among social members. Thus, we can observe that how macro-level pattern of income inequality arises from micro-level competitive interactions of individuals embedded within an ideal institutional environment.^{Footnote 1}

Without loss of generality, we consider an “N-person non-cooperative game”, where are N consumers (or agents), each of whom operates a firm, so there are N firms. Following the basic assumptions of neoclassical economics, each consumer should be selfish and have infinite desire; therefore, all of these firms will pursue maximum profit, and all of these consumers will exchange with each other to obtain maximum satisfaction. Furthermore, if a consumer is employed in a firm that he does not operate, he will obtain the ownership share of that firm. Because consumer i operates firm i, his income consists of the firm’s operational revenue and the returns on holding the shares from other firms, where $i=1,2,\ldots ,N$. All of these settings are explained in detail in “Appendix A”. In accordance with the basic settings of ADGEM, all the firms should be sufficiently competitive so that monopoly cannot arise; therefore, by the rule of gaining income above, each firm actually looks like a self-employed household or a small trader. This means that we can use household income data to test validity of our upcoming model. As the Pareto optimal solution to ADGEM that captures all of these settings above, Tao proved that (Tao 2015, 2016), in the long-run competition, each consumer’s equilibrium income should be completely random, and obeys the following constraint:

$$\begin{aligned} \left\{ {{\begin{array}{l} I_i \ge 0 \quad for \quad i=1,2,\ldots ,N \\ \mathop \sum \nolimits _{i=1}^N I_i =Y \\ \end{array} }} \right. \end{aligned}$$

(1)

where $I_i $ denotes the equilibrium income of the $i\hbox {th}$ consumer and Y denotes GDP (Gross Domestic Product).

Here we use $A=\left\{ {I_1 ,I_2 ,\ldots ,I_N } \right\} $ to specify an “equilibrium income allocation” (EIA) among N consumers. Due to the randomness of Pareto optimal solution (1), there is a large number of EIAs. To eliminate uncertainty of optimal allocations, the proposal of traditional economists is to seek the best one by using a social preference function (Mas-Collel et al. 1995). Unfortunately, Arrow’s Impossibility Theorem has denied the existence of social preference (Arrow 1963). This is the well-known “dilemma of social choice”. However, Tao proposes that such a dilemma can be avoided by using the paradigm of natural selection (Tao 2016). To be specific, regarding each EIA as a random event and income distribution as a set of EIAs, we make a conjecture that, among all possible income distributions, the one endowed with the largest probability, i.e. the likeliest, ought to be selected, so it is survival of the likeliest (Whitfield 2007; Harte et al. 2008; Tao 2010, 2016). If our conjecture is right, we should expect the household income data to exhibit such an income distribution.

The focus of this paper is on income distribution in a democratic economy. To find the probability of each income distribution occurring under the democratic environment, we apply Rawls’ justice principle of fair equality of opportunity (Rawls 1999) to ADGEM. Since ADGEM is an ideally just procedure, fair equality of opportunity indicates that each EIA should occur with an equal probability (Tao 2016). Rawls’ fairness in a democratic economy means that the door of opportunity is open to all social members (Rawls 1999). Rawls’ fairness principle is illustrated for an example of “2-person allocation” in “Appendix B”. When Rawls’ fairness principle is applied to “N-person allocation” subject to constraints (1) where N and Y are large enough, we find that the exponential income distribution occurs with the highest probability (detailed derivation is given in “Appendix C”):

$$\begin{aligned} \left\{ {{\begin{array}{l} f\left( x \right) =\frac{1}{\theta }e^{\frac{- ({x-\mu })}{\theta }} \\ x\ge \mu \\ \end{array} }} \right. \end{aligned}$$

(2)

or equivalently

$$\begin{aligned} \left\{ {{\begin{array}{l} P\left( {t\ge x} \right) =e^{\frac{-({x-\mu })}{\theta }} \\ x\ge \mu \\ \end{array} }} \right. . \end{aligned}$$

(3)

Here x denotes income level, $f\left( x \right) $ is the probability density of income x, and $P\left( {t\ge x} \right) $ is the cumulative probability distribution, i.e. the fraction of population with the income higher than x.

The free parameters $\mu $ and $\theta $ denote marginal labor-capital return and marginal technology return (Tao 2010, 2016), respectively (see “Appendix D”). The constraint $x\ge \mu $ is considered as the Rational Agent Hypothesis (Tao 2010) in neoclassical economics, which states that firms (or agents) enter the market if and only if they can gain the marginal labor-capital return at least to pay for the cost; otherwise they will make a loss. Such a hypothesis explains why the exponential distribution fails to fit the super-low income data at x lower than $\mu $. This is one limitation to applicability of the exponential income distribution (2). On the other hand, by the settings of ADGEM, each firm is sufficiently competitive, and hence looks like a self-employed household; therefore, the exponential income distribution (2) does not fit super-rich people (high income class) who should operate large firms (or monopolistic firms).^{Footnote 2} Thus, income distribution of super-rich people obeys a power law (Axtell 2001) due to the rule “the rich get richer” (Tao 2015) (i.e. the Matthew effect) rather than Rawls’ equal opportunities. Consequently, when we fit income data using the exponential distribution (2), we should drop super-low and high income data. Finally, we point out that other scholars (Foley 1994; Chakrabarti and Chakrabarti 2009; Venkatasubramanian et al. 2015) have also applied the concepts of Rawls’ fairness, utility and maximum entropy to derive income distribution; however, our derivation has the advantage of being based on ADGEM and specifying the range of applicability of the theoretical distribution.

3 Empirical test for 67 countries

We can estimate the values of $\mu $ and $\theta $ by fitting empirical income data to the cumulative probability distribution given in Eq. (3). The datasets employed in this paper come from many sources at the country level and consist of income data for a large sample of percentiles. Using data for a wide span of years, we obtain a dataset of 67 countries around the world, especially European and Latin American countries. The sources of data are fully described in the “Appendix F”. Because our model is based on the ADGEM, which describes an ideal market economy, we expect the exponential distribution to be applicable for the well-developed market economies. To this end, we primarily focus on OECD countries, for which it is also easier to find detailed and reliable income distribution data. Outside OECD, it is often difficult to get detailed-enough, reliable data in the appropriate format. So, the 67 countries analyzed in this paper are those for which we managed to find the data from the sources listed in “Appendix F”. Further effort would be desirable to expand the list of countries in future work. From the household income data, which is classified into macro income quantile data, obtained for each country, we compute the cumulative distribution of income $P\left( {t\ge x} \right) $, which is the ratio of the number of social members whose income is larger than x to the total population.

Following our theoretical construction, the empirical analysis is conducted in two steps. First, we take logs to the values of the cumulative distribution equation and run a step-by-step ordinary least square (OLS) regression to the sample. Since we investigate the relationship between cumulative distribution of income and income level, according to the scope of applicability of exponential income distribution, we need to drop the high-income samples, as they follow a power law (Axtell 2001; Tao 2015). Resorting to the goodness of fit criteria, we select the samples based on the largest adjusted $R^{2}$ values criteria. To be specific, we first take logs to Eq. (3),

$$\begin{aligned} ln\,\left[ {P\left( {t\ge x} \right) } \right] =y=\beta x+\alpha +\varepsilon , \end{aligned}$$

(4)

where $\beta =-1/\theta $, $\alpha =\mu /\theta $, then we regress y on x using the OLS method. In the second step, based on the regression results obtained from the first step, we compute the value of marginal labor-capital return $\mu $, which equals to the inverse of the ratio of the intercept to the slope coefficient, that is $\mu =-\alpha /\beta $. Furthermore, by Rational Agent Hypothesis, we drop the super-low income samples whose values are less than $\mu $, and again we run an OLS regression to the “new” sample.

To illustrate our testing process, we first apply the aforementioned empirical strategy to United Kingdom. In the years of 1999–2000 to 2013–2014, following the maximal adjusted $R^{2}$ rule, we drop the super high income data first. According to our theoretical formulation, the high income people do not conform to the assumptions of ADGEM. In fact, the number of these people is relatively small, but their total income is quite large. When the top-income samples are removed, based on the regression parameters of Eq. (3), we get the value of $\mu $, then we further drop the samples whose values are less than $\mu $. Once again, we run an OLS regression on the purged data to fit the data to our exponential distribution. For comparison, we also fit the data on the full sample; see the two panels in Fig. 1 for details. Likewise, the same empirical testing procedure is applied to other countries around the world. The results of fitting are shown by Figs. 2 and 3. Figure 2 shows 34 mostly European countries for which Eurostat data are available, and Fig. 3 shows 32 countries and HongKong SAR from other areas. One can observe visually that agreement between theory and empirical data is very good. Furthermore, the goodness of fit parameters for the exponential income distribution (3) to 67 countries are reported in Tables S1-S3 (see Supplementary Material). We show that the adjusted $R^{2}$ of almost all these countries approach 0.99.

Here we point out that the method of removing the high income class using the maximized $R^{2}$ can be regarded as a filtering procedure. By our model, the exponential function fits the middle range of income distribution, so it is necessary to filter out the data at the high and low ends of distribution to reveal the exponential pattern. The filtering is always inevitable in any data analysis performed to extract signal from noisy or mixed data, so it is not an absolute question of data integrity, but rather a practical one of whether the filtering procedure is reasonable or not. We believe that our procedure above is reasonably reliable and convincing, because it converges after removal of a quite small fraction of the data. Later, we will observe that the estimated value of $\mu $ produced by the filtering procedure for maximized $R^{2}$ indeed agrees with empirical data. Despite this, we still do not verify that the estimate of $\mu $ produced by the filtering procedure is consistent. In fact, because we only collect the sample data of household income, we must prove that the estimated value of $\mu $ sufficiently approaches the true value when the number of sample is large enough; otherwise, we cannot guarantee that the estimate of $\mu $ proposed by us is consistent. In next section, we will show that the estimate of $\mu $ produced by filtering procedure is consistent.

4 Consistent estimate of $\mu $

In Sect. 3, we have shown that the exponential distribution (3) remarkably fits the low and middle parts of household income data from 67 countries. The only problem is that we don’t know if the fitting procedure produces the consistent estimate of $\mu $. For the full data (i.e., population), the Eq. (4) can be written as:

$$\begin{aligned} y_j =\beta ^{*}x_j +\alpha ^{*}+\varepsilon _j , \end{aligned}$$

(5)

where $\beta ^{*}=-\frac{1}{\theta ^{*}}$, $\alpha ^{*}=\frac{\mu ^{*}}{\theta ^{*}}$, and $\varepsilon _j \sim N\left( {0,\sigma ^{2}} \right) $ for $j=1,2,\ldots ,\infty $. Here $\left\{ {x_j } \right\} _{j=1}^\infty $ and $\left\{ {y_j } \right\} _{j=1}^\infty $ denote the full data. $\beta ^{*}$ and $\alpha ^{*}$ are obtained by regressing $\left\{ {y_j } \right\} _{j=1}^\infty $ on $\left\{ {x_j } \right\} _{j=1}^\infty $.

It must be noted that, due to the constraint $x\ge \mu $, the Eq. (3) differs slightly from the Eq. (5). Therefore, we cannot ensure if $\mu ^{*}=\mu $. In fact, the Eq. (3) implies that $\left\{ {x_j } \right\} _{j=1}^\infty $ should be a strictly monotonic increasing sequence with $x_j \ge 0$ for $j=1,2,\ldots ,\infty $. More importantly, it indicates that there exists a positive integer $g^{*}$ to guarantee $x_k \ge \mu $ for $k=g^{*},g^{*}+1,\ldots ,\infty $. This means that, for the full data, the Eq. (3) should be written as:

$$\begin{aligned} \left\{ {{\begin{array}{l} y_k =\beta x_k +\alpha +\varepsilon _k \\ x_k \ge \mu \\ \end{array} }} \right. , \end{aligned}$$

(6)

where $\beta =-\frac{1}{\theta }$, $\alpha =\frac{\mu }{\theta }$, and $\varepsilon _k \sim N\left( {0,\sigma ^{2}} \right) $ for $k=g^{*},g^{*}+1,\ldots ,\infty $. Here $\beta $ and $\alpha $ are obtained by regressing $\left\{ {y_j } \right\} _{j=g^{*}}^\infty $ on $\left\{ {x_j } \right\} _{j=g^{*}}^\infty $.

By Lemma 4 in “Appendix E”, we have proved that if $g^{*}<\infty $, then one has $\beta =\beta ^{*}$, $\alpha =\alpha ^{*}$. Therefore, the Eq. (6) can be rewritten in the form:

$$\begin{aligned} \left\{ {{\begin{array}{l} y_k =\beta ^{*}x_k +\alpha ^{*}+\varepsilon _k \\ x_k \ge \mu ^{*} \\ \end{array} }} \right. , \end{aligned}$$

(7)

where $k=g^{*},g^{*}+1,\ldots ,\infty $ and $g^{*}<\infty $.

Obviously, our purpose is to find $\mu $. The Eq. (7) reminds us that if one can collect the full data $\left\{ {x_j } \right\} _{j=1}^\infty $ and $\left\{ {y_j } \right\} _{j=1}^\infty $, then $\mu $ can be obtained by computing $\mu ^{*}$. Unfortunately, nobody can collect full data, so it’s impossible to obtain the Eq. (7). However, based on sample data $\left\{ {x_l } \right\} _{l=1}^n $ and $\left\{ {y_l } \right\} _{l=1}^n $, we can consider the following statistical estimate equation:

$$\begin{aligned} \left\{ \begin{array}{l} \hat{y} _i =\hat{\beta } _g x_i +\hat{\alpha } _g \\ x_i \ge \hat{\mu } _g \\ \end{array} \right. , \end{aligned}$$

(8)

where $i=g,g+1,\ldots ,n$ and n denotes sample size. It’s worth emphasizing that $g=g\left( n \right) $ is undetermined.

Here

$$\begin{aligned} \hat{\beta }_g= & {} \frac{\mathop \sum \nolimits _{i=g}^n \left( {x_i -\bar{x} _g } \right) \left( {y_i -\bar{y} _g } \right) }{\mathop \sum \nolimits _{i=g}^n \left( {x_i -\bar{x} _g } \right) ^{2}}, \end{aligned}$$

(9)

$$\begin{aligned} \hat{\alpha } _g= & {} \bar{y} _g -\hat{\beta } _g \bar{x} _g , \end{aligned}$$

(10)

$$\begin{aligned} \hat{\mu } _g= & {} -\frac{\hat{\alpha } _g }{\hat{\beta } _g }, \end{aligned}$$

(11)

$$\begin{aligned} \bar{x} _g= & {} \frac{1}{n-g+1}\mathop \sum \nolimits _{i=g}^n x_i , \end{aligned}$$

(12)

$$\begin{aligned} \bar{y} _g= & {} \frac{1}{n-g+1}\mathop \sum \nolimits _{i=g}^n y_i . \end{aligned}$$

(13)

Due to the absence of full data, we cannot obtain $\mu $. However, we hope $\hat{\mu } _g \rightarrow \mu $ if $n\rightarrow \infty $. In “Appendix E”, we have proved the following proposition:

Proposition 3

For a strictly monotonic increasing sequence $\left\{ {x_j } \right\} _{j=1}^n $, if there exists an integer $g=g\left( n\right) $ to guarantee:

(A).
$x_{i-1}<\mu <x_i $ or $x_i =\mu $, where $i=g<n$ and $lim_{n\rightarrow \infty } \frac{g}{n}=0$;
(B).
$\frac{\bar{y} _g }{\hat{\beta } _g }>\delta >0$ for any n;

then one has:
$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =lim_{n\rightarrow \infty } \left( {\bar{x} _g -\frac{\bar{y} _g }{\hat{\beta } _g }} \right) =\mu , \end{aligned}$$
(14)
where g is uniquely determined by n and $g<\infty $. This means:
$$\begin{aligned} lim_{n\rightarrow \infty } g=g^{*}. \end{aligned}$$
(15)

Proof

See “Appendix E”. $\square $

Proposition 3 indicates that $\hat{\mu } _g $ is a consistent estimate if (A) and (B) hold. That is to say, if the sample size n is large enough, we expect that $\hat{\mu } _g $ is extremely close to $\mu $. Because nobody can obtain $\mu $, our purpose can be changed to find a value close to $\mu $. Obviously, Proposition 3 implies that the estimate $\hat{\mu } _g $ will provide such a value. Next we show that (B) can be related to the correlation coefficient between $\left\{ {x_i } \right\} _{i=g}^n $ and $\left\{ {y_i } \right\} _{i=g}^n $.

Lemma 5

If $y_i <0$ for $i=1,\ldots ,n$, and if $r_g <0$ for any n, then one has $\frac{\bar{y} _g }{\hat{\beta } _g }>0$ for any n, where $r_g =\frac{\mathop \sum \nolimits _{i=g}^n ( {x_i -\bar{x} _g } )( {y_i -\bar{y} _g } )}{\sqrt{\mathop \sum \nolimits _{i=g}^n ( {x_i -\bar{x} _g } )^{2}\cdot \mathop \sum \nolimits _{i=g}^n ( {y_i -\bar{y} _g } )^{2}}}$ denotes the correlation coefficient between $\{ {x_i } \}_{i=g}^n $ and $\{ {y_i } \}_{i=g}^n $.

Proof

By Eq. (9) we have :

$$\begin{aligned} r_g =\hat{\beta }_g \cdot \sqrt{\frac{\mathop \sum \nolimits _{i=g}^n \left( {x_i -\bar{x} _g } \right) ^{2}}{\mathop \sum \nolimits _{i=g}^n \left( {y_i -\bar{y} _g } \right) ^{2}}}. \end{aligned}$$

(16)

Thus, if $r_g <0$ for any n, one concludes $\hat{\beta } _g <0$ for any n, where we have used the Assumptions (b) and (c) in “Appendix E”. Since $y_i <0$ leads to $\bar{y} _g <0$, we conclude^{Footnote 3}$\frac{\bar{y} _g }{\hat{\beta } _g }>0$ for any n. $\square $

By using Lemma 5, Proposition 3 leads to the following corollary.

Corollary 1

For a strictly monotonic increasing sequence $\left\{ {x_j } \right\} _{j=1}^n $, if $y_j <0$ for $j=1,\ldots ,n$, and if there exists an integer $g=g\left( n \right) $ to guarantee:

(C).
$x_{i-1}<\mu <x_i $ or $x_i =\mu $, where $i=g<n$ and $lim_{n\rightarrow \infty } \frac{g}{n}=0$;
(D).
$r_g<\gamma <0$ for any n; then one has:
$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =\mu , \end{aligned}$$
where g is uniquely determined by n and $g<\infty $. This means:
$$\begin{aligned} lim_{n\rightarrow \infty } g=g^{*}. \end{aligned}$$

Proof

Using Proposition 3 and Lemma 5 we complete this proof. $\square $

Obviously, Eq. (3) implies $y_i <0$ for $i=g,g+1,\ldots ,n$. Therefore, we can employ the Corollary 1 to seek $\hat{\mu } _g $. The step is as below:

First, we seek the minimal l to satisfy $r_l <0$ for $\left\{ {x_i} \right\} _{i=l}^n $ and $\left\{ {y_i } \right\} _{i=l}^n $. Second, we regress $\left\{ {x_i } \right\} _{i=l}^n $ and $\left\{ {y_i } \right\} _{i=l}^n $ to obtain h and $\hat{\mu } _h $. Third, we test $r_h $: If $r_h <0$ holds, we conclude that the regress result $\hat{\mu } _h =\hat{\mu } _g $ is a valid estimate value; if $r_h \ge 0$, we use $\left\{ {x_i } \right\} _{i=h}^n $ and $\left\{ {y_i } \right\} _{i=h}^n $ to repeat the steps 1-3. The computing process should end at the finite steps; otherwise, $\left\{ {x_i } \right\} _{i=1}^n $ and $\left\{ {y_i } \right\} _{i=1}^n $ do not fit Eq. (3).

Table 1 Correlation coefficients and estimate ${\hat{{\mu }}}_{{g}}$ for the United Kingdom

Full size table

It’s easy to check that the filtering procedure in Sect. 3 is in accordance with the three steps above, provided that $r_g <0$ holds. For simplicity, we only list the correlation coefficients $r_g $ for the United Kingdom in Table 1, which are all negative. The readers can test the other countries, which also exhibit the negative correlation coefficients (see Figs. 2, 3). Therefore, we believe that the estimate values for $\mu $ computed in Tables S1-S3 are convincing. It’s worth mentioning that the assumption $\varepsilon _j \sim N\left( {0,\sigma ^{2}} \right) $ in Eq. (6) holds if and only if the high-income samples can be adequately removed. This is because high-income samples, which obey the power law, will lead to systematic errors so that $\varepsilon _j \sim N\left( {0,\sigma ^{2}} \right) $ breaks down. In Sect. 3, we remove the high-income samples (systematic errors) based on the rule of maximized $R^{2}$ to get the estimate value $\mu _R $. However, the rule of maximized $R^{2}$ is not the unique method. In fact, Fig. 1 implies that, for the United Kingdom, we may remove only three quantile in high-income samples to get $\hat{\mu } _g $. Remarkably, the Proposition 3 implies that $\hat{\mu } _g $ should be close to $\mu _R $ if the sample size is large enough. In terms of our data, the United Kingdom data has the most quantile, and so yields the largest sample size (approximately equals 100). Therefore, it’s better to compare the estimate values $\hat{\mu } _g$ and $\mu _R$ in terms of the United Kingdom data. We have listed the results in Table 1, where the readers can check that the differences only yield the order of 0.01.

5 Discussion

The empirical results above imply that the exponential income law universally holds in most countries all over the world. Because we have investigated 67 countries from different areas, the validity of exponential income law appears to be robust. Compared to log-normal and gamma distributions, which have two or more fitting parameters, the exponential law essentially has only one fitting parameter $1/\theta $, and produces a more parsimonious fit of the data. More importantly, our exponential law (3) is compatible with the standard model of modern economics (namely ADGEM); therefore, the fitting parameters $\mu $ and $\theta $ have explicit economic meaning. In fact, $\mu $ denotes the marginal labor-capital return, and it is proportional to the minimum wage (Tao 2017). Concretely, we can obtain (Tao 2017):

$$\begin{aligned} \mu =\sigma \cdot \omega -\sigma \cdot r\cdot MRTS_{LK} , \end{aligned}$$

(17)

where $\sigma $ denotes the marginal employment level, $\omega $ denote the minimum wage, r denotes the interest rate, and $MRTS_{LK} $ denotes the marginal rate of technical substitution of labor and capital. The brief derivation for Eq. (17) can be found in “Appendix D”.

The marginal employment level $\sigma $ stands for the increasing number of employment once a firm enters markets (Tao 2017); therefore, it’s easy to understand $\sigma \ge 0$. Thus, Eq. (17) implies that the marginal labor-capital return $\mu $ is theoretically proportional to the minimum wage $\omega $. Obviously, the minimum wage $\omega $, like unemployment compensation, can be regarded as a critical income level at which labors would like to enter or exit markets. Therefore, we might as well identify $\omega $ by the unemployment compensation.

To test the relationship between $\mu $ and $\omega $, we collected the unemployment compensation data for 26 European countries in the years of 2011 to 2014. Using the computed values of $\mu $ for European countries from Table S2, we can directly test if there is a positive relationship between marginal labor-capital return and unemployment compensation by the OLS regression. The empirical results are shown in Fig. 4 and Table S4 (see Supplementary Material). From these results we find that the marginal labor-capital return $\mu $ (i.e., MLCR in Fig. 4), is strongly positive correlated with the unemployment compensation (i.e., UC in Fig. 4), with the Pearson correlation coefficients being separately 0.864, 0.904, 0.899 and 0.880 (from 2011 to 2014). Remarkably, the confidence levels of correlation coefficients are surprisingly high, since p value $<0.001$ for all four years, as shown in Table S4. It is worth mentioning that the Eq. (17) implies that $\mu $ is inversely proportional to r if^{Footnote 4}$MRTS_{LK} >0$ (Tao 2017). Recently, Tao (2017) has collected the real data of the interest rate r to do the cross-section regression between $\mu $, $\omega $ and r. Tao’s empirical results show that the marginal labor-capital return $\mu $ is indeed inversely proportional to the interest rate r (Tao 2017).

Due to the robust results of our study, some significant policy recommendations can be made: by moderately increasing the level of unemployment compensation, the income inequalities originated from low and middle income classes may be reduced, because the Gini coefficient of the exponential distribution is equal to $G=1/\left[ {2\left( {1+\mu /\theta } \right) } \right] $, see detailed derivation in Tao et al. (2017). To keep efficiency and fairness in competitive markets, we propose that the source of paying unemployment compensation should come from levying a tax on high income class. This is because, unlike high income class, the low and middle income class evolves to a competitive equilibrium combining efficiency and Rawls’ fairness. The traditional tax policy which artificially changes the income structure of low and middle income class may harm market efficiency and fairness.

6 Conclusion

We have shown that the standard Arrow–Debreu’s general equilibrium model combined with Rawls’ fairness principle naturally produces the exponential distribution of income, which agrees well with the empirical data for 67 countries around the world. These results provide a solid justification for the exponential income distribution within the mainstream economic framework. Furthermore, our findings may have broader socio-economic implications, because the exponential income law is, effectively, a result of natural selection of the likeliest (Whitfield 2007; Tao 2016), i.e. the most probable, distribution. The Arrow–Debreu’s general equilibrium model describes an ideal institutional environment (similar to ecological environment), which permits different income structures. Relative to other structures, the exponential income distribution occurs with the highest probability, and so it represents survival of the likeliest structure, also named as “Spontaneous Order” (Tao 2016). These results are relevant for evolutionary economics (Mackmurdo 1940; Nelson and Winter 1982; Potts 2001; Hodgson 2004; Dopfer 2004; Foster and Metcalfe 2012), which is concerned with the direction of social evolution. The exponential distribution (2) is obtained by maximization of entropy $ln\varOmega $ (see “Appendix D”), which indicates the direction of evolution. According to neoclassical economics, the entropy in our model is interpreted as technological progress (Tao 2016), as discussed in “Appendix D”, so the higher technological progress is the likeliest direction of social evolution: among all possible social systems, those whose technological level happens to be the highest will be “selected” as survivors. In other words, those social systems that possess the lower technological level will be more likely eliminated in the process of social evolution. Our insights seem to be in accordance with the existing historical facts.

Notes

The emergence of income inequality can be traced back to the pioneering work of Angle (1986, 1992, 1993, 1996, and 2006).
In the neoclassical economics, monopolistic power implies that the behaviors among firms are highly heterogeneous. Interestingly, Lux and Marchesi (1999) also showed that heterogeneous behaviors among economic agents may lead to a power law in financial markets.
Here we have considered ${\textit{lim}}_{i \rightarrow \infty } \, y_i \ne 0$.
When labor L and capital K substitute with each other, we have ${\textit{MRTS}}_{{\textit{LK}}} > 0$.
Full sample means $\{x_1, \ldots , x_n\}$, where n denotes the sample size.

References

Acemoglu D, Robinson J (2009) Foundation of societal inequality. Science 326(5953):678–679
Article Google Scholar
Angle J (1986) The surplus theory of social stratification and the size distribution of personal wealth. Soc Forces 65:293–326
Article Google Scholar
Angle J (1992) The inequality process and the distribution of income to blacks and whites. J Math Sociol 17:77–98
Article Google Scholar
Angle J (1993) Deriving the size distribution of personal wealth from “the rich get richer, the poor get poorer”. J Math Sociol 18:27–46
Article Google Scholar
Angle J (1996) How the gamma law of income distribution appears invariant under aggregation. J Math Sociol 31:325–358
Article Google Scholar
Angle J (2006) The inequality process as a wealth maximizing process. Physica A 367:388–414
Article Google Scholar
Arrow KJ (1963) Social choice and individual values. Wiley, New York
Google Scholar
Arrow KJ, Debreu G (1954) Existence of an equilibrium for a competitive economy. Econometrica 22(3):265–290
Article Google Scholar
Atkinson AB, Piketty T, Saez E (2011) Top incomes in the long run of history. J Econ Lit 49(1):3–71
Article Google Scholar
Autor DH (2014) Skills, education, and the rise of earnings inequality among the “other 99 percent”. Science 344(6186):843–851
Article Google Scholar
Autor DH, Katz LF, Kearney MS (2008) Trends in U.S. wage inequality: revising the revisionists. Rev Econ Stat 90(2):300–323
Article Google Scholar
Axtell RL (2001) Zipf distribution of U.S. firm sizes. Science 293(5536):1818–1820
Article Google Scholar
Banerjee A, Yakovenko VM (2010) Universal patterns of inequality. New J Phys 12:075032
Article Google Scholar
Banerjee A, Yakovenko VM, Di Matteo T (2006) A study of the personal income distribution in Australia. Physica A 370(1):54–59
Article Google Scholar
Chakrabarti AS, Chakrabarti BK (2009) Microeconomics of the ideal gas like market models. Physica A 388(19):4151–4158
Article Google Scholar
Chakrabarti BK, Chakraborti A, Chakravarty SR, Chatterjee A (2013) Econophysics of income and wealth distributions. Cambridge University Press, Cambridge
Book Google Scholar
Cho A (2014) Physicists say it’s simple. Science 344(6186):828–828
Article Google Scholar
Clementi F, Gallegati M, Kaniadakis G (2010) A model of personal income distribution with application to Italian data. Empir Econ 39:559–591
Article Google Scholar
Clementi F, Gallegati M, Kaniadakis G (2012) A new model of income distribution: the $\kappa $-generalized distribution. J Econ 105:63–91
Article Google Scholar
Derzsy N, Néda Z, Santos MA (2012) Income distribution patterns from a complete social security database. Physica A 391(22):5611–5619
Article Google Scholar
Dopfer K (2004) The economic agent as rule maker and rule user: Homo Sapiens Oeconomicus. J Evol Econ 14:177–195
Article Google Scholar
Dragulescu A, Yakovenko VM (2000) Statistical mechanics of money. Eur Phys J B 17(4):723–729
Article Google Scholar
Dragulescu A, Yakovenko VM (2001a) Evidence for the exponential distribution of income in the USA. Eur Phys J B 20(4):585–589
Article Google Scholar
Dragulescu A, Yakovenko VM (2001b) Exponential and power-law probability distributions of wealth and income in the United Kingdom and the United States. Physica A 299(1–2):213–221
Article Google Scholar
Foley DK (1994) A statistical equilibrium theory of markets. J Econ Theory 62(2):321–345
Article Google Scholar
Foster J, Metcalfe JS (2012) Economic emergence: an evolutionary economic perspective. J Econ Behav Organ 82(2–3):420–432
Article Google Scholar
Golosov M, Maziero P, Menzio G (2013) Taxation and redistribution of residual income inequality. J Polit Econ 121(6):1160–1204
Article Google Scholar
Harte J, Zillio T, Conlisk E, Smith AB (2008) Maximum entropy and the state-variable approach to macroecology. Ecology 89(10):2700–2711
Article Google Scholar
Heathcote J, Storesletten K, Violante GL (2010) The macroeconomic implications of rising wage inequality in the United States. J Polit Econ 118(4):681–722
Article Google Scholar
Hodgson GM (2004) The evolution of institutional economics: agency, structure and Darwinism in American Institutionalism. Routledge, London
Book Google Scholar
Jagielski M, Kutner R (2013) Modelling of income distribution in the European Union with the Kokker–Planck equation. Physica A 392(9):2130–2138
Article Google Scholar
Jones CI (2015) Pareto and Piketty: the macroeconomics of top income and wealth inequality. J Econ Perspect 29(1):29–46
Article Google Scholar
Kakwani N (1980) Income inequality and poverty. Oxford University Press, Oxford
Google Scholar
Katz L, Autor D (1999) Changes in the wage structure and earnings inequality. In: Ashenfelter O, Card D (eds) Handbook of labor economics, vol 3A. North-Holland, Amsterdam
Google Scholar
Kuznets S (1955) Economic growth and income inequality. Am Econ Rev 45(1):1–28
Google Scholar
Lai TL, Robbins H, Wei CZ (1979) Strong consistency of least squares estimates in multiple regression II. J Multivar Anal 9(3):343–361
Article Google Scholar
Lambert PJ (1993) The distribution and redistribution of income: a mathematical analysis, 2nd edn. Manchester University Press, Manchester
Google Scholar
Lux T, Marchesi M (1999) Scaling and criticality in a stochastic multi-agent model of a financial market. Nature 397:498–500
Article Google Scholar
Mackmurdo AH (1940) The social organism. Nature 145(3666):187–187
Article Google Scholar
Mandelbrot B (1960) The Pareto–Levy law and the distribution of income. Int Econ Rev 1(2):79–106
Article Google Scholar
Mas-Collel A, Whinston MD, Green JR (1995) Microeconomic theory. Oxford University Press, Oxford
Google Scholar
Moretti E (2013) Real wage inequality. Am Econ J Appl Econ 5(1):65–103
Article Google Scholar
Nelson RR, Winter SG (1982) An evolutionary theory of economic change. The Belknap Press of Harvard University Press, Cambridge
Google Scholar
Nirei M, Souma W (2007) A two factor model of income distribution dynamics. Rev Income Wealth 53(3):440–459
Article Google Scholar
Nishi A, Shirado H, Rand DG, Christakis NA (2015) Inequality and visibility of wealth in experimental social networks. Nature 526(7573):426–429
Article Google Scholar
Oancea B, Andrei T, Pirjol D (2016) Income inequality in Romania: the exponential-Pareto distribution. Physica A. https://doi.org/10.1016/j.physa.2016.11.094.
Pareto V (1897) Cours d’ Economie Politique. L’ Universite de Lausanne, Lausanne
Google Scholar
Piketty T (2003) Income inequality in France, 1901–1998. J Polit Econ 111:1004–1042
Article Google Scholar
Piketty T, Qian N (2009) Income inequality and progressive income taxation in China and India, 1986–2015. Am Econ J Appl Econ 1(2):53–63
Article Google Scholar
Piketty T, Saez E (2003) Income inequality in the United States, 1913–1998. Q J Econ 118:1–39
Article Google Scholar
Piketty T, Saez E (2014) Inequality in the long run. Science 344(6186):838–843
Article Google Scholar
Potts J (2001) Knowledge and markets. J Evol Econ 11:413–431
Article Google Scholar
Ravallion M (2014) Income inequality in the developing world. Science 344(6186):851–855
Article Google Scholar
Rawls J (1999) A theory of justice (revised edition). Harvard University Press, Cambridge
Google Scholar
Rudin W (1976) Principles of mathematical analysis, 3rd edn. McGraw-Hill, Inc, New York
Google Scholar
Saez E, Zucman G (2016) Wealth inequality in the United States since 1913: evidence from capitalized income tax data. Q J Econ 131:519–578
Article Google Scholar
Shaikh A (2016) Income distribution, econophysics and piketty. Rev Polit Econ. https://doi.org/10.1080/09538259.2016.1205295.
Shaikh A, Papanikolaou N, Wiener N (2014) Race, gender and the econophysics of income distribution in the USA. Physica A 415:54–60
Article Google Scholar
Silva AC, Yakovenko VM (2005) Temporal evolution of the “thermal” and “superthermal” income classes in the USA during 1983–2001. Europhys Lett 69(2):304–310
Article Google Scholar
Tao Y (2010) Competitive market for multiple firms and economic crisis. Phys Rev E 82(3):036118
Article Google Scholar
Tao Y (2015) Universal laws of human society’s income distribution. Physica A 435:89–94
Article Google Scholar
Tao Y (2016) Spontaneous economic order. J Evol Econ 26(3):467–500
Article Google Scholar
Tao Y (2017) An index measuring the deviation of a real economy from the general equilibrium: evidence from the OECD Countries. Available at SSRN: https://doi.org/10.2139/ssrn.2792556
Tao Y, Wu X, Li C (2017) Rawls’ fairness, income distribution and alarming level of Gini coefficient. Economics discussion papers, no 2017-67. Kiel Institute for the World Economy. http://www.economics-ejournal.org/economics/discussionpapers/2017-67
Venkatasubramanian V, Luo Y, Sethuraman J (2015) How much inequality in income is fair? A microeconomic game theoretic perspective. Physica A 435:120–138
Article Google Scholar
Walras L (2003) Elements of pure economics or the theory of social wealth. Routledge, London
Google Scholar
Whitfield J (2007) Survival of the likeliest? PLoS Biol 5(5):e142
Article Google Scholar
Yakovenko VM, Rosser JB Jr (2009) Statistical mechanics of money, wealth, and income. Rev Mod Phys 81(4):1703–1717
Article Google Scholar

Download references

Acknowledgements

The authors would like to thank two anonymous referees and the editorial board for valuable comments and suggestions. All errors remain ours. Victor Yakovenko was supported by grant “Statistical Physics Approach to Income and Wealth Distribution” from the Institute for New Economic Thinking (INET), and Yong Tao by the Fundamental Research Funds for the Central Universities of China (Grant No. SWU1409444).

Author information

Authors and Affiliations

College of Economics and Management, Southwest University, Chongqing, 400715, China
Yong Tao
College of Hanhong, Southwest University, Chongqing, 400715, China
Yanyuxiang Huang & Han Yu
College of Economics, Hangzhou Dianzi University, Hangzhou, 310018, China
Xiangjun Wu
Big Data Research Center, University of Electronic Science and Technology of China, Chengdu, 611731, China
Tao Zhou
School of Economics and Finance, Xi’an Jiaotong University, Xi’an, 710049, China
Weibo Yan
Department of Physics, University of Maryland, College Park, MD, 20742-4111, USA
Benedict Mondal
Department of Physics, CMTC and JQI, University of Maryland, College Park, MD, 20742-4111, USA
Victor M. Yakovenko

Authors

Yong Tao
View author publications
You can also search for this author in PubMed Google Scholar
Xiangjun Wu
View author publications
You can also search for this author in PubMed Google Scholar
Tao Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Weibo Yan
View author publications
You can also search for this author in PubMed Google Scholar
Yanyuxiang Huang
View author publications
You can also search for this author in PubMed Google Scholar
Han Yu
View author publications
You can also search for this author in PubMed Google Scholar
Benedict Mondal
View author publications
You can also search for this author in PubMed Google Scholar
Victor M. Yakovenko
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yong Tao or Victor M. Yakovenko.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (docx 424 KB)

Appendices

Appendix

N-person non-cooperative game

Arrow–Debreu’s General Equilibrium Model (ADGEM) is based on the well-known two criteria of neoclassical economics: utility maximization and profit maximization. If there are N consumers, each of whom operates a firm, the ADGEM describing their optimal behavior uses the following principles (Tao 2015, 2016):

(a)
Profit maximization: For each firm $i=1,\ldots ,N$, $y_i^*\in Y_i $ maximizes profits such that $p\cdot y_i \le p\cdot y_i^*$ for all $y_i \in Y_i $.
(b)
Utility maximization: For each consumer $i=1,\ldots ,N$, $x_i^*\in X_i $ is the solution of maximizing the preference $\mathop \succ \limits _{\sim i} $ under the budget set: $\big \{ x_i \in X_i :p\cdot x_i \le p\cdot \omega _i +\mathop \sum \nolimits _{j=1}^N \theta _{ij} p\cdot y_j^*\big \}$.
(c)
Market clearing: $\mathop \sum \nolimits _{i=1}^N x_i^*=\mathop \sum \nolimits _{i=1}^N \omega _i +\mathop \sum \nolimits _{i=1}^N y_i^*$.

Here $x_i $ and $X_i $ represent consumption vector and consumption set of the $i \mathrm{th}$ consumer, respectively; $y_i $ and $Y_i $ represent production vector and production set of the $i\mathrm{th}$ firm, respectively (Mas-Collel et al. 1995); $\theta _{ij} $ represents an ownership share of each firm $j=1,\ldots ,N$ paid to the $i\hbox {th}$ consumer. The allocation $\left( {x_1^*,\ldots ,x_N^*;y_1^*,\ldots ,y_N^*} \right) $ and a price vector $p=\left( {p_1 ,\ldots ,p_L } \right) $ constitute a Pareto optimal solution to ADGEM (a)–(c).

Rawls’ fairness of “2-person allocation”

For illustration, let us consider a simple “2-person society” in which the GDP is denoted by $2 and each person can earn a possible equilibrium income with $0, $1 or $2. For the “2-person society”, the Eq. (1) can be expressed in the form:

$$\begin{aligned} \left\{ {{\begin{array}{l} I_i =0,1,2\quad for \quad i=1,2 \\ \mathop \sum \nolimits _{i=1}^2 I_i =2 \\ \end{array} }} \right. . \end{aligned}$$

(B.1)

By Eq. (B.1), the “2-person society” will have three equilibrium income allocation (EIA): $A_1 =\left\{ {0,2} \right\} $, $A_2 =\left\{ {2,0} \right\} $ and $A_3 =\left\{ {1,1} \right\} $. They have been shown as below:

By Rawls’ principle of fair equality of opportunity, each EIA should occur with an equal probability (Tao 2015, 2016); therefore, each person’s expected income equals $1. The detailed calculation is as below:

$$\begin{aligned} \hbox {Probability}\left( A_{1} \right)= & {} 1/3,\hbox {Probability}\left( {A_2 } \right) =1/3,\hbox {Probability}\left( {A_3 } \right) =1/3 \nonumber \\ \hbox {Woman's Expected Income}= & {} 0\times \left( {1/3} \right) + 2\times \left( {1/3} \right) +1\times \left( {1/3} \right) =1 \end{aligned}$$

(B.2)

$$\begin{aligned} \hbox {Man's Expected Income}= & {} 2\times \left( {1/3} \right) +0\times \left( {1/3} \right) +1\times \left( {1/3} \right) =1 \end{aligned}$$

(B.3)

This means that each person owns the equal opportunity of earning money. If we denote the equal income distribution by a and the unequal income distribution by b, we do have $a=\left\{ A_{3} \right\} $ and $b=\left\{ {A_1 ,A_2 } \right\} $. By Rawls’ principle of fair equality of opportunity, a will occur with probability 1 / 3 and b will occur with probability 2 / 3. Following the rule of “survival of the likeliest”, b will be a result of natural selection.

Density function of income distribution

For “N-person allocation”, Tao has shown that, by applying Rawls’ fairness into Eq. (1) where N and Y are large enough, one will get the exponential income distribution which occurs with the highest probability (Tao 2015, 2016):

$$\begin{aligned} a_k= & {} g_k e^{-\frac{( {\varepsilon _k -\mu } )}{\theta }}, \nonumber \\&\varepsilon _1<\varepsilon _2<\cdots <\varepsilon _n. \end{aligned}$$

(C.1)

Here $\mu $ and $\theta $ denote marginal labor-capital return and marginal technology return, respectively (Tao 2016); readers can find the origin of these two parameters in “Appendix D”.

The formula (C.1) indicates that there are $a_k $ consumers each of which obtains $\varepsilon _k $ units of revenue, and k runs from 1 to n. Because income distribution (C.1) will occur with the highest probability, Tao call it the “spontaneous economic order” (Tao 2016).

The formula (C.1) can be rewritten in the form of continuous function. To see this, let us first observe:

$$\begin{aligned} \mathop \sum \limits _{k=1}^n a_k =N, \end{aligned}$$

(C.2)

which leads to:

$$\begin{aligned} \mathop \sum \limits _{k=1}^n \frac{a_k }{N}=1. \end{aligned}$$

(C.3)

Here $\frac{a_k}{N}$ denotes the proportion of populations each of whom earns $\varepsilon _k $ units of income. Now we write $\frac{a_k }{N}$ in the form of continuous function: $f\left( x \right) $. To this end, let us order:

$$\begin{aligned} f\left( x \right) =w\cdot e^{\frac{- ( {x-\mu })}{\theta }}, \end{aligned}$$

(C.4)

where x, which replaces $\varepsilon _k $, denotes a continuous income level, and by Rational Agent Hypothesis one has (Tao 2010) $x\ge \mu $.

Here w is an undetermined constant, which will be determined by the sum formula (C.3). Let us replace $\frac{a_k }{N}$ by (C.4), and transform sum operation of formula (C.3) into integral operation:

$$\begin{aligned} \mathop \int \nolimits _\mu ^{+\infty } w\cdot e^{\frac{-({x-\mu })}{\theta }}dx=1, \end{aligned}$$

(C.5)

which leads to $w=\frac{1}{\theta }$.

Finally, we obtain the density function of income distribution:

$$\begin{aligned} f\left( x \right) =\frac{1}{\theta }e^{\frac{- ({x-\mu })}{\theta }}. \end{aligned}$$

(C.6)

Technological progress and entropy

Because the firm consists of labor and capital, the Cobb–Douglas aggregate production function (or GDP) of neoclassical economics can be written in the form (Tao 2010, 2016):

$$\begin{aligned} Y=Y\left( {N\left( {L,K} \right) ,H} \right) , \end{aligned}$$

(D.1)

where L and K denote labor and capital, whereas $N\hbox { and }H$ denote the number of firms and technological progress.

The complete differential of (D.1) yields [see also Eq. (9) in Banerjee and Yakovenko (2010)]:

$$\begin{aligned} \hbox {d}Y\left( {N\left( {L,K} \right) ,H} \right) =\mu dN\left( {L,K} \right) +\theta dH, \end{aligned}$$

(D.2)

where $\mu =\partial Y/\partial N$ and $\theta =\partial Y/\partial H$ denote the marginal labor-capital return and the marginal technology return (Tao 2016), respectively.

Here Tao identifies the entropy $ln\varOmega $ with the technological progress H (Tao 2010, 2016):

$$\begin{aligned} H=ln{\varOmega }, \end{aligned}$$

(D.3)

where ${\varOmega }$ denotes the number of equilibrium income allocations that a given income distribution contains [Furthermore, ${\varOmega }$ also measures the choice freedom of social members (Tao 2016)]. For example, for the 2-person society described by “Appendix B”, we have ${\varOmega } \left( a \right) =1$ and ${\varOmega } \left( b \right) =2$. By maximizing (Tao 2010, 2015, 2016) ${\varOmega }$ one can obtain the exponential income distribution (2). Consequently, the technological progress H can be regarded as the entropy of socio-economical systems.

Furthermore, the complete differential of Eq. (D.1) can be rewritten in the form:

$$\begin{aligned} dY=\omega dL+rdK+\theta dH, \end{aligned}$$

(D.4)

where $\omega =\partial Y/\partial L$ and $r=\partial Y/\partial K$ denote marginal labor return and marginal capital return (Tao 2017), respectively. On the one hand, we might as well assume that capital markets exhibit perfect competition, so r also denotes the interest rate. On the other hand, by the principle of diminishing marginal return in neoclassical economics, $\omega $ denotes the minimum wage. Comparing Eqs. (D.2) and (D.4), we can obtain (Tao 2017):

$$\begin{aligned} \mu =\omega \cdot \sigma -r\cdot \sigma \cdot MRTS_{LK} , \end{aligned}$$

(D.5)

where $\sigma =dL/dN$ and $MRTS_{LK} =-dK/dL$. Here $\sigma $ denotes the marginal employment level and $MRTS_{LK} $ denotes the marginal rate of technical substitution of labor and capital (Tao 2017).

Main propositions

To obtain the consistent estimate of $\mu $, we do the estimate analysis in terms of two cases: full sample and truncation sample. In this paper, $lim_{n\rightarrow \infty } a_n =a$ means $lim_{n\rightarrow \infty } P\left( {a_n =a} \right) =1$, where $P\left( \xi \right) $ denotes the probability of $\xi $ occurring.

1.1 Full sample

Let us first drop the constraint $x\ge \mu $. For the full data (i.e., population), the Eq. (4) can be written in the form:

$$\begin{aligned} y_j= & {} \beta ^{*}x_j +\alpha ^{*}+\varepsilon _j , \end{aligned}$$

(E.1)

$$\begin{aligned} \mu ^{*}= & {} -\frac{\alpha ^{*}}{\beta ^{*}}, \end{aligned}$$

(E.2)

where $\beta ^{*}=-\frac{1}{\theta ^{*}}$, $\alpha ^{*}=\frac{\mu ^{*}}{\theta ^{*}}$, and $\varepsilon _j \sim N\left( {0,\sigma ^{2}} \right) $ for $j=1,2,\ldots ,\infty $. Here $\left\{ {x_j } \right\} _{j=1}^\infty $ and $\left\{ {y_j } \right\} _{j=1}^\infty $ denote the full data. $\beta ^{*}$ and $\alpha ^{*}$ are obtained by regressing $\left\{ {y_j } \right\} _{j=1}^\infty $ on $\left\{ {x_j } \right\} _{j=1}^\infty $.

For the full sample^{Footnote 5}, the sample estimates of Eqs. (E.1) and (E.2) yield:

$$\begin{aligned} \hat{y} _i= & {} \hat{\beta } x_i +\hat{\alpha } , \end{aligned}$$

(E.3)

$$\begin{aligned} \hat{\mu }= & {} -\frac{\hat{\alpha } }{\hat{\beta } }, \end{aligned}$$

(E.4)

where $i=1,2,\ldots ,n$.

Due to the absence of the constraint $x\ge \mu $, equation (E.1) differs slightly from Eq. (3); therefore, we don’t ensure if $\mu ^{*}=\mu $. In section E2, we will discuss the estimate of $\mu $ when $x\ge \mu $ holds. In this section, we mainly investigate the consistency of the estimate (E.4).

Taking the least squares estimation on Eq. (E.3) we have:

$$\begin{aligned} \hat{\beta }= & {} \frac{\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) \left( {y_i -\bar{y} } \right) }{\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}}, \end{aligned}$$

(E.5)

$$\begin{aligned} \hat{\alpha }= & {} \bar{y} -\hat{\beta } \bar{x}, \end{aligned}$$

(E.6)

where $\bar{x} =\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i $ and $\bar{y} =\frac{1}{n}\mathop \sum \nolimits _{i=1}^n y_i $.

Since the exponential distribution (3) is only suitable for the low and middle parts of the income data, we should drop the high income data. Moreover, due to the economic meanings of $x_i $ in the Eq. (3), $\left\{ {x_i } \right\} _{i=1}^n $ should be a monotonic increasing sequence. Thus, we can make the following assumptions.

Assumptions

(a).
$\left| {x_i } \right| <\infty $ and $\left| {y_i } \right| <\infty $ for $i=1,2,\ldots ,n$.
(b).
$\left\{ {x_i } \right\} _{i=1}^n $ is a strictly monotonic increasing sequence with $x_i \ge 0$ for $i=1,2,\ldots ,n$.
(c).
$\varepsilon _j $ are i.i.d. $N\left( {0,\sigma ^{2}} \right) $.

Next we verify that $\hat{\beta } $ and $\hat{\alpha } $ are consistent estimates.

Theorem 1

Assume that $\varepsilon _j $ are i.i.d. $N\left( {0,\sigma ^{2}} \right) $. If there is $lim_{n\rightarrow \infty } \left( {{\varvec{X}}^{T}{\varvec{X}}} \right) ^{-1}={\varvec{0}}$, then one has:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\beta }= & {} \beta ^{*}, \end{aligned}$$

(E.7)

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\alpha }= & {} \alpha ^{*}, \end{aligned}$$

(E.8)

where ${{\varvec{X}}}=\left( {{\begin{array}{lll} {x_1}&{} \cdots &{} {x_n } \\ 1&{} \cdots &{} 1 \\ \end{array} }} \right) ^{T}$.

Proof

See Lai et al. (1979). $\square $

To verify Eqs. (E.7) and (E.8), we can only prove the following proposition.

Proposition 1

$lim_{n\rightarrow \infty } \left( {{{\varvec{X}}}^{T}{{\varvec{X}}}} \right) ^{-1}=\mathbf{0}$.

Proof

It’s easy to compute:

$$\begin{aligned} \left( {{{\varvec{X}}}^{T}{{\varvec{X}}}} \right) ^{-1}=\frac{1}{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}}\left( {{\begin{array}{cc} n &{} {-\mathop \sum \nolimits _{i=1}^n x_i } \\ {-\mathop \sum \nolimits _{i=1}^n x_i }&{} {\mathop \sum \nolimits _{i=1}^n x_i^2 } \\ \end{array} }} \right) , \end{aligned}$$

so proving $lim_{n\rightarrow \infty } \left( {{{\varvec{X}}}^{T}{{\varvec{X}}}} \right) ^{-1}=\mathbf{0}$ is equivalent to verifying:

$$\begin{aligned} lim_{n\rightarrow \infty } \frac{1}{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}}\left( {{ \begin{array}{cc} n&{} {-\mathop \sum \nolimits _{i=1}^n x_i } \\ {-\mathop \sum \nolimits _{i=1}^n x_i }&{} {\mathop \sum \nolimits _{i=1}^n x_i^2 } \\ \end{array} }} \right) =\left( {{\begin{array}{cc} 0&{} 0 \\ 0&{} 0 \\ \end{array} }} \right) . \end{aligned}$$

(E.9)

Obviously, proving Eq. (E.9) is equivalent to verifying the following three equations:

$$\begin{aligned}&lim_{n\rightarrow \infty } \frac{n}{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i} \right) ^{2}}=0, \end{aligned}$$

(E.10)

$$\begin{aligned}&\quad lim_{n\rightarrow \infty } \frac{\mathop \sum \nolimits _{i=1}^n x_i }{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i} \right) ^{2}}=0, \end{aligned}$$

(E.11)

$$\begin{aligned}&\quad lim_{n\rightarrow \infty } \frac{\mathop \sum \nolimits _{i=1}^n x_i^2 }{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i} \right) ^{2}}=0. \end{aligned}$$

(E.12)

One can compute:

$$\begin{aligned} n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}=n^{2}\left[ {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\bar{x} } \right) ^{2}} \right] . \end{aligned}$$

(E.13)

Furthermore, we have the following result:

$$\begin{aligned}&\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\bar{x} } \right) ^{2}=\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i^2 -2\left( {\bar{x} } \right) ^{2} \nonumber \\&\quad + \left( {\bar{x} } \right) ^{2}=\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}. \end{aligned}$$

(E.14)

By Assumption (b) we must have $\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}\ne 0$; otherwise, $x_i =\bar{x} $ for $i=1,2,\ldots ,n$, contradicting the strict monotonicity. On the other hand, by the strict monotonicity, there should be at most one number $x_l $ leading to $x_l =\bar{x} $. Thus, if we order $\mathop {\min }\limits _{i\ne l} \left| {x_i -\bar{x} } \right| =A$, then we have $\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}\ge 0+\left( {n-1} \right) \cdot A^{2}$.

Consequently, by Eq. (E.14) we can obtain:

$$\begin{aligned} \left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\bar{x} } \right) ^{2}} \right| =\left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}} \right| \ge \frac{n-1}{n}\cdot A^{2}. \end{aligned}$$

(E.15)

Using Eqs. (E.13) and (E.15) one has

$$\begin{aligned}&\left| {\frac{1}{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}}} \right| \nonumber \\&\quad =\left| {\frac{1}{n^{2}\left[ {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\bar{x} } \right) ^{2}} \right] }} \right| \nonumber \\&\quad \le \frac{1}{n^{2}\cdot \frac{n-1}{n}\cdot A^{2}}=\frac{1}{n\cdot \left( {n-1} \right) \cdot A^{2}}. \end{aligned}$$

(E.16)

On the other hand, by Assumption (a), we can order ${\textit{max}}_i \left| {x_i } \right| =B$; therefore, we have:

$$\begin{aligned} \left| {\mathop \sum \nolimits _{i=1}^n x_i } \right| =\mathop \sum \nolimits _{i=1}^n \left| {x_i } \right| \le n\cdot B, \end{aligned}$$

(E.17)

$$\begin{aligned} \left| {\mathop \sum \nolimits _{i=1}^n x_i^2 } \right| =\mathop \sum \nolimits _{i=1}^n x_i^2 \le n\cdot B^{2}. \end{aligned}$$

(E.18)

Using Eqs. (E.16)–(E.18), we can obtain:

$$\begin{aligned}&\left| {\frac{n}{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}}} \right| \nonumber \\&\quad \le \frac{n}{n\cdot \left( {n-1} \right) \cdot A^{2}}=\frac{1}{\left( {n-1} \right) \cdot A^{2}}. \end{aligned}$$

(E.19)

$$\begin{aligned}&\quad \left| {\frac{\mathop \sum \nolimits _{i=1}^n x_i }{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i} \right) ^{2}}} \right| \le \frac{n\cdot B}{n\cdot \left( {n-1} \right) \cdot A^{2}}=\frac{B}{\left( {n-1} \right) \cdot A^{2}}. \end{aligned}$$

(E.20)

$$\begin{aligned}&\quad \left| {\frac{\mathop \sum \nolimits _{i=1}^n x_i^2 }{n\mathop \sum \nolimits _{i=1}^n x_i^2 -\left( {\mathop \sum \nolimits _{i=1}^n x_i } \right) ^{2}}} \right| \le \frac{n\cdot B^{2}}{n\cdot \left( {n-1} \right) \cdot A^{2}}=\frac{B^{2}}{\left( {n-1} \right) \cdot A^{2}}. \end{aligned}$$

(E.21)

Imposing $n\rightarrow \infty $ on Eqs. (E.19)–(E.21) one can obtain Eqs. (E.10)–(E.12). $\square $

By using the Theorem 1, it’s easy to compute:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } =-\frac{\alpha ^{*}}{\beta ^{*}}=\mu ^{*}. \end{aligned}$$

(E.22)

Equation (E.22) indicates that if there is no the constraint $x\ge \mu $, then the estimate $\hat{\mu } $ is consistent. However, the existence of the constraint $x\ge \mu $ may lead to the inconsistency of estimate $\hat{\mu } $.

1.2 Truncation sample

Now let us recover the constraint $x\ge \mu $. Since the constraint $x\ge \mu $ holds, we attempt to construct a truncation estimate of $\mu $. To this end, we might as well assume that $\mu $ has existed. Thus, the truncation of the full data $x_j $ can be written as:

$$\begin{aligned} x_j \ge \mu , \end{aligned}$$

(E.23)

where $j=g^{*},g^{*}+1,\ldots ,\infty $.

Using the truncation data (E.23), Eq. (4) can be written as:

$$\begin{aligned} y_k= & {} \beta x_k +\alpha +\varepsilon _k , \end{aligned}$$

(E.24)

$$\begin{aligned} x_k\ge & {} \mu , \end{aligned}$$

(E.25)

where $\beta =-\frac{1}{\theta }$, $\alpha =\frac{\mu }{\theta }$, and $\varepsilon _k \sim N\left( {0,\sigma ^{2}} \right) $ for $k=g^{*},g^{*}+1,\ldots ,\infty $. Here $\beta $ and $\alpha $ are obtained by regressing $\left\{ {y_j } \right\} _{j=g^{*}}^\infty $ on $\left\{ {x_j } \right\} _{j=g^{*}}^\infty $.

Thus, the sample estimates of Eqs. (E.24) and (E.25) yield:

$$\begin{aligned} \hat{y} _i= & {} \hat{\beta } _g x_i +\hat{\alpha } _g , \end{aligned}$$

(E.26)

$$\begin{aligned} x_i\ge & {} \hat{\mu } _g , \end{aligned}$$

(E.27)

where $i=g,g+1,\ldots ,n$ and $g=g\left( n \right) $. Here $\left\{ {x_i } \right\} _{i=g}^n $ and $\left\{ {y_i } \right\} _{i=g}^n $ denote truncation sample. It’s worth emphasizing that $g^{*}$ and $g=g\left( n \right) $ are undetermined.

Taking the least squares estimation on Eq. (E.26) we have:

$$\begin{aligned} \hat{\beta } _g= & {} \frac{\mathop \sum \nolimits _{i=g}^n \left( {x_i -\bar{x} _g } \right) \left( {y_i -\bar{y} _g } \right) }{\mathop \sum \nolimits _{i=g}^n \left( {x_i -\bar{x} _g } \right) ^{2}}, \end{aligned}$$

(E.28)

$$\begin{aligned} \hat{\alpha } _g= & {} \bar{y} _g -\hat{\beta } _g \bar{x} _g , \end{aligned}$$

(E.29)

where $\bar{x} _g =\frac{1}{n-g+1}\mathop \sum \nolimits _{i=g}^n x_i$ and $\bar{y} _g =\frac{1}{n-g+1}\mathop \sum \nolimits _{i=g}^n y_i$.

The main purpose of this section is to derive the estimate $\hat{\mu } _g $. Assume $g^{*}<\infty $, thus we will have the following theorem and proposition:

Theorem 2

Assume that $\varepsilon _j $ are i.i.d. $N\left( {0,\sigma ^{2}} \right) $. If there is $lim_{n\rightarrow \infty } \left( {{{\varvec{X}}}_{g^{*}}^T {{\varvec{X}}}_{g^{*}}} \right) ^{-1}=\mathbf{0}$, then one has:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\beta } _{g^{*}}= & {} \beta , \end{aligned}$$

(E.30)

$$\begin{aligned} lim_{n\rightarrow \infty }\hat{\alpha } _{g^{*}}= & {} \alpha , \end{aligned}$$

(E.31)

where ${{\varvec{X}}}_{g^{*}} =\left( {{\begin{array}{lll} {x_{g^{*}} }&{} \cdots &{} {x_n } \\ 1&{} \cdots &{} 1 \\ \end{array}}} \right) ^{T}$.

Proof

Same as the Theorem 1. $\square $

Proposition 2

$lim_{n\rightarrow \infty } \left( {{{\varvec{X}}}_{g^{*}}^T {{\varvec{X}}}_{g^{*}} } \right) ^{-1}=\mathbf{0}$.

Proof

Same as the Proposition 1. $\square $

Consistent with the form of Eq. (E.4), $\hat{\mu } _g $ can be defined as:

$$\begin{aligned} \hat{\mu } _g =-\frac{\hat{\alpha } _g }{\hat{\beta } _g }. \end{aligned}$$

(E.32)

Now we start to derive the consistent condition of guaranteeing the validity of estimate (E.32).

Substituting Eqs. (E.29) into (E.32) one has:

$$\begin{aligned} \hat{\mu } _g =\bar{x} _g -\frac{\bar{y} _g }{\hat{\beta } _g }, \end{aligned}$$

(E.33)

which guarantees that the constraint of Eq. (E.26) has been imposed on the estimate (E.32).

On the other hand, Eq. (E.27) indicates:

$$\begin{aligned} \bar{x} _g >\hat{\mu } _g +\delta , \end{aligned}$$

(E.34)

where we have used the Assumption (b) and $\delta >0$.

Inserting Eqs. (E.33) into (E.34) yield:

$$\begin{aligned} \frac{\bar{y} _g }{\hat{\beta } _g }>\delta >0, \end{aligned}$$

(E.35)

which guarantees that the constraint of Eq. (E.27) has been imposed on the estimate (E.32).

Thus, we can obtain the core proposition of this Appendix as below:

Proposition 3

For a strictly monotonic increasing sequence $\left\{ {x_j } \right\} _{j=1}^n $, if there exists an integer $g=g\left( n \right) $ to guarantee:

(A).
$x_{i-1}<\mu <x_i $ or $x_i =\mu $, where $i=g<n$ and $lim_{n\rightarrow \infty } \frac{g}{n}=0$;
(B).
$\frac{\bar{y} _g }{\hat{\beta } _g }>\delta >0$ for any n;

then one has:
$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =lim_{n\rightarrow \infty } \left( {\bar{x} _g -\frac{\bar{y}_g }{\hat{\beta } _g }} \right) =\mu , \end{aligned}$$
(E.36)
where g is uniquely determined by n and $g<\infty $. This means:
$$\begin{aligned} lim_{n\rightarrow \infty } g=g^{*}. \end{aligned}$$
(E.37)

To verify the Proposition 3, we need to prove the following four lemmas:

Lemma 1

If $\left\{ {\xi _i } \right\} _{i=1}^n $ is a monotonic sequence and if $\left| {\xi _i } \right| <\infty $ for any i, then one has:

$$\begin{aligned} lim_{n\rightarrow \infty } \xi _n =\xi , \end{aligned}$$

(E.38)

where $\left| \xi \right| <\infty $.

Proof

See the theorem 3.14 in Rudin (1976). $\square $

Lemma 2

For the sequence $\left\{ {\xi _i } \right\} _{i=1}^n $, if $lim_{n\rightarrow \infty } \xi _n =\xi $, then one has:

$$\begin{aligned} lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i =\xi . \end{aligned}$$

(E.39)

Proof

Since $lim_{n\rightarrow \infty } \xi _n =\xi $, by the definition of limit, for every $\upepsilon >0$ there always exists a positive integer N so that when $k>N$, one has:

$$\begin{aligned} \left| {\xi _k -\xi } \right| <\frac{\epsilon }{2}. \end{aligned}$$

(E.40)

To verify Eq. (E.39), we only need to prove:

$$\begin{aligned} lim_{n\rightarrow \infty } \left( {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i -\xi } \right) =0; \end{aligned}$$

(E.41)

that is, for every $\upepsilon >0$ there always exists a positive integer $N_0$ so that when $n>N_0 $, one has:

$$\begin{aligned} \left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i -\xi } \right| <\epsilon . \end{aligned}$$

(E.42)

It’s easy to compute:

$$\begin{aligned}&\left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i -\xi } \right| \nonumber \\&\quad =\left| {\frac{1}{n}\left[ {\mathop \sum \nolimits _{i=1}^N \left( {\xi _i -\xi } \right) +\mathop \sum \nolimits _{j=N+1}^n \left( {\xi _j -\xi } \right) } \right] } \right| \nonumber \\&\quad \le \frac{1}{n}\left| {\mathop \sum \nolimits _{i=1}^N \left( {\xi _i -\xi } \right) } \right| +\frac{1}{n}\left| {\mathop \sum \nolimits _{j=N+1}^n \left( {\xi _j -\xi } \right) } \right| . \end{aligned}$$

(E.43)

Because $lim_{n\rightarrow \infty } \xi _n =\xi $, it’s easy to verify that $\left| {\xi _i } \right| <\infty $ and $\left| \xi \right| <\infty $. Thus, one has ${\textit{max}}_i \left| {\xi _i -\xi } \right| <\infty $. Consequently, thanks to $j>N$, Eq. (E.43) can be written in the form:

$$\begin{aligned}&\left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i -\xi } \right| \nonumber \\&\quad \le \frac{N}{n} {\textit{max}}_i \left| {\xi _i -\xi } \right| +\frac{n-N}{n}\frac{\epsilon }{2} \nonumber \\&\quad <\frac{N}{n} {\textit{max}}_i \left| {\xi _i -\xi } \right| +\frac{\epsilon }{2}. \end{aligned}$$

(E.44)

where we have used Eq. (E.40).

It’s easy to compute $lim_{n\rightarrow \infty } \frac{N}{n} {\textit{max}}_i \left| {\xi _i -\xi } \right| =0$. This means that for every $\upepsilon >0$ there always exists a positive integer $N_1$ so that when $k>N_1 $, one has:

$$\begin{aligned} \frac{N}{k} {\textit{max}}_i \left| {\xi _i -\xi } \right| <\frac{\epsilon }{2}. \end{aligned}$$

(E.45)

Let us order $N_0 =max\left\{ {N,N_1 } \right\} $; thus, substituting Eqs. (E.45) into (E.44) we conclude that for every $\upepsilon >0$ when $n>N_0 $, there always holds:

$$\begin{aligned} \left| {\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \xi _i -\xi } \right| <\epsilon . \end{aligned}$$

$\square $

Lemma 3

If $lim_{n\rightarrow \infty } \frac{g}{n}=0$, one has:

$$\begin{aligned}&lim_{n\rightarrow \infty } \bar{x} _g =lim_{n\rightarrow \infty } \bar{x} =x, \\&\quad lim_{n\rightarrow \infty } \bar{y} _g =lim_{n\rightarrow \infty } \bar{y} =y, \end{aligned}$$

where $x=lim_{n\rightarrow \infty } x_n $ and $y=\beta ^{*}x+\alpha ^{*}$.

Proof

We first verify $lim_{n\rightarrow \infty } \bar{x} _g =lim_{n\rightarrow \infty } \bar{x} $. It’s easy to check:

$$\begin{aligned} \bar{x}= & {} \frac{1}{n}\mathop \sum \nolimits _{i=1}^n x_i =\frac{1}{n}\mathop \sum \nolimits _{i=1}^{g-1} x_i \nonumber \\&+\,\frac{n-g+1}{n}\frac{1}{n-g+1}\mathop \sum \nolimits _{j=g}^n x_j \nonumber \\= & {} \frac{1}{n}\mathop \sum \nolimits _{i=1}^{g-1} x_i +\frac{n-g+1}{n}\bar{x} _g. \end{aligned}$$

(E.46)

Since $lim_{n\rightarrow \infty } \frac{g}{n}=0$, imposing $n\rightarrow \infty $ on Eq. (E.46) one obtains:

$$\begin{aligned} lim_{n\rightarrow \infty } \bar{x} _g =lim_{n\rightarrow \infty } \bar{x} , \end{aligned}$$

where we have used $\left| {x_i } \right| <\infty $.

Since Assumptions (a) and (b) hold, by using Lemma 1 one has: $lim_{n\rightarrow \infty } x_n =x$. This means that by using Lemma 2 one obtains $lim_{n\rightarrow \infty } \bar{x} =x$. Therefore, we verify $lim_{n\rightarrow \infty } \bar{x} _g =lim_{n\rightarrow \infty } \bar{x} =x$.

Now we start to verify $lim_{n\rightarrow \infty } \bar{y} _g =lim_{n\rightarrow \infty } \bar{y} =y$.

Based on the same technique from Eq. (E.46), by Assumption (a) we can verify $lim_{n\rightarrow \infty } \bar{y} _g =lim_{n\rightarrow \infty } \bar{y} $. By using Eq. (E.1), one has:

$$\begin{aligned} \bar{y} =\beta ^{*}\bar{x} +\alpha ^{*}+\bar{\varepsilon }, \end{aligned}$$

(E.47)

where $\bar{\varepsilon } =\frac{1}{n}\mathop \sum \nolimits _{i=1}^n \varepsilon _i$.

By Assumption (c) $\varepsilon _j $ are i.i.d. $N\left( {0,\sigma ^{2}} \right) $, so by using the law of large numbers, it’s easy to obtain:

$$\begin{aligned} lim_{n\rightarrow \infty } \bar{\varepsilon } =E\left( {\varepsilon _i \hbox {|}x_1 ,\ldots ,x_n } \right) =0. \end{aligned}$$

Therefore, substituting the above equation into Eq. (E.47) leads to:

$$\begin{aligned} lim_{n\rightarrow \infty } \bar{y} =\beta ^{*}lim_{n\rightarrow \infty } \bar{x} +\alpha ^{*}+lim_{n\rightarrow \infty } \bar{\varepsilon } =\beta ^{*}x+\alpha ^{*}. \end{aligned}$$

$\square $

Lemma 4

If $lim_{n\rightarrow \infty } \frac{g}{n}=0$, one has:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\beta }_g= & {} lim_{n\rightarrow \infty } \hat{\beta } =\beta =\beta ^{*}. \end{aligned}$$

(E.48)

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\alpha } _g= & {} lim_{n\rightarrow \infty } \hat{\alpha } =\alpha =\alpha ^{*}. \end{aligned}$$

(E.49)

Proof

Here we only verify Eq. (E.48). By the same technique, one can verify Eq. (E.49).

It’s easy to check:

$$\begin{aligned} \hat{\beta }= & {} \frac{\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) \left( {y_i -\bar{y} } \right) }{\mathop \sum \nolimits _{i=1}^n \left( {x_i -\bar{x} } \right) ^{2}} \nonumber \\= & {} \frac{\frac{1}{n}\mathop \sum \nolimits _{i=1}^{g-1} \left( {x_i -\bar{x} } \right) \left( {y_i -\bar{y} } \right) +\frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} } \right) \left( {y_j -\bar{y} } \right) }{\frac{1}{n}\mathop \sum \nolimits _{i=1}^{g-1} \left( {x_i -\bar{x} } \right) ^{2}+\frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} } \right) ^{2}}. \end{aligned}$$

(E.50)

Imposing $n\rightarrow \infty $ on Eq. (E.50) one obtains:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\beta }= & {} \frac{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} } \right) \left( {y_j -\bar{y} } \right) }{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} } \right) ^{2}}. \nonumber \\= & {} \frac{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -lim_{n\rightarrow \infty } \bar{x} } \right) \left( {y_j -lim_{n\rightarrow \infty } \bar{y} } \right) }{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -lim_{n\rightarrow \infty } \bar{x} } \right) ^{2}}, \end{aligned}$$

(E.51)

where we have used $\left| {x_i } \right| <\infty $, $\left| {y_i } \right| <\infty $ and $lim_{n\rightarrow \infty } \frac{g}{n}=0$.

Using Lemma 3, Eq. (E.51) can be rewritten as:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\beta }= & {} \frac{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -lim_{n\rightarrow \infty } \bar{x} _g } \right) \left( {y_j -lim_{n\rightarrow \infty } \bar{y} _g } \right) }{lim_{n\rightarrow \infty } \frac{1}{n}\mathop \sum \nolimits _{j=g}^n \left( {x_j -lim_{n\rightarrow \infty } \bar{x} _g } \right) ^{2}}. \nonumber \\= & {} lim_{n\rightarrow \infty } \frac{\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} _g } \right) \left( {y_j -\bar{y} _g } \right) }{\mathop \sum \nolimits _{j=g}^n \left( {x_j -\bar{x} _g } \right) ^{2}} \nonumber \\= & {} lim_{n\rightarrow \infty } \, \hat{\beta }_g. \end{aligned}$$

(E.52)

Because $g^{*}<\infty $, by the same technique for deriving Eq. (E.52), we can obtain: $lim_{n\rightarrow \infty } \hat{\beta } =lim_{n\rightarrow \infty } \hat{\beta } _{g^{*}} $.

On the other hand, by Theorem 1 one has $lim_{n\rightarrow \infty } \hat{\beta } =\beta ^{*}$ and by Theorem 2 one has $lim_{n\rightarrow \infty } \hat{\beta } _{g^{*}} =\beta $. Therefore, we conclude that Eq. (E.48) holds. $\square $

Now we start to verify the Proposition 3.

Proof of Proposition 3

Imposing $n\rightarrow \infty $ on Eq. (E.33) one obtains:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =lim_{n\rightarrow \infty } \bar{x} _g -\frac{lim_{n\rightarrow \infty } \bar{y} _g }{lim_{n\rightarrow \infty } \hat{\beta } _g }. \end{aligned}$$

(E.53)

Using Lemmas 1–4, Eq. (E.53) equals:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =x-\frac{y}{\beta }. \end{aligned}$$

(E.54)

We have known

$$\begin{aligned} \mu =-\frac{\alpha }{\beta }. \end{aligned}$$

(E.55)

Imposing $n\rightarrow \infty $ on Eq. (E.29) one obtains:

$$\begin{aligned} \alpha =y-\beta x. \end{aligned}$$

(E.56)

Substituting Eqs. (E.55) and (E.56) into Eq. (E.54) yields:

$$\begin{aligned} lim_{n\rightarrow \infty } \hat{\mu } _g =x-\frac{y}{\beta }=\mu . \end{aligned}$$

(E.57)

Because $\frac{\bar{y} _g }{\hat{\beta }_g }>\delta >0$ for any n, by Lemmas 3 and 4 we have:

$$\begin{aligned} lim_{n\rightarrow \infty } \frac{\bar{y} _g }{\hat{\beta }_g }=\frac{y}{\beta }\ge \delta >0. \end{aligned}$$

(E.58)

Thus, by Eq. (E.57) we must conclude:

$$\begin{aligned} \mu<x<\infty , \end{aligned}$$

(E.59)

where we have used $\left| {x_i } \right| <\infty $.

Since $x_{g-1}<\mu <x_g $ or $x_g =\mu $, by Assumption (b) we have:

$$\begin{aligned} 0\le \mu<x<\infty . \end{aligned}$$

Now we further verify that, for a given n, there is no another $ g^{\prime } \ne g $ to guarantee that $ x_{{g^{\prime } - 1}}< \mu < x_{{g^{\prime }}}$ or $ x_{{g^{\prime }}} = \mu $. We discuss this point in terms of two cases. First, if $ x_{{g^{\prime } - 1}}< \mu < x_{{g^{\prime }}} $ holds, we have to conclude $ x_{{g - 1}}< \mu < x_{{g^{\prime }}} $ and $ x_{{g^{\prime } - 1}}< \mu < x_{g}$. For this case, we might as well assume $ g^{\prime } > g$, which by Assumption (b) leads to $ x_{{g^{\prime }}} > x_{g}$. This means $ x_{g} \le x_{{g - 1}}$, contradicting Assumption (b). Likewise, we can refute $ g^{\prime } < g$. Second, if $x_{{g^{\prime }}} = \mu $ and $ g^{\prime } \ne g$, then by Assumption (b) the contradiction occurs. In summary, we must conclude $ g^{\prime } = g$.

Finally, we verify $g<\infty $. If $g=\infty $, by $x_{g-1}<\mu <x_g $ or $x_g =\mu $, we have to conclude $lim_{l\rightarrow \infty } x_l =\mu =x$, which contradicts $\mu<x<\infty $.

Based on the results above, we should have $lim_{n\rightarrow \infty } g=g^{*}$. To see this, we might as well assume that $lim_{n\rightarrow \infty } g>g^{*}$. Then, by $x_{g-1}<\mu =x_{g^{*}} <x_g $, one has $lim_{n\rightarrow \infty } g-1=g^{*}$, which contradicts $x_{lim_{n\rightarrow \infty } g-1} <x_{g^{*}} $, where we have used $g<\infty $. $\square $

Description of data sources

Source	Countries	Link
Socio-Economic Database of Latin America and the Caribbean	ARG, BLZ, BOL, BRA, CHL, COL, CRI, DOM, ECU, SLV, HTI, HND, MEX, PRY, PER, URY, VEN	http://sedlac.econo.unlp.edu.ar/eng/statistics.php
Australian Bureau of Statistics	AUS	http://www.ausstats.abs.gov.au/ausstats/subscriber.nsf/0/B0530ECF7A48B909CA257BC80016E4D3/$File/65230_2011-12.pdf
Eurostat	AUT, BEL, BGR, HRV, CYP, CZE, DNK, EST, FIN, FRA, DEU, GRC, HUN, ISL, IRL, ITA, LVA, LTU, LUX, MKD, MLT, NLD, NOR, POL, PRT, ROU, SRB, SVK, SVN, ESP, SWE, CHE, TUR, GBR	http://appsso.eurostat.ec.europa.eu/nui/show.do?dataset=ilc_di01&lang=en
Statistics Canada	CAN	http://www.statcan.gc.ca/tables-tableaux/sum-som/l01/cst01/famil105a-eng.htm
Hong Kong	HKG	http://www.census2011.gov.hk/pdf/household-income.pdf
Nepal Rastra Bank	NPL	http://www.nrb.org.np/red/publications/study_reports/Study_ReportsHousehold%20Budget%20Survey%202008%20(Report).pdf
Russian Federal State Statistics Service	RUS	http://www.arcticstat.org/Table.aspx/Region/Russian_Federation/Indicator/Personal!Household_Income/2008-08-21-05/10874
Singapore Department of Statistics	SGP	http://www.singstat.gov.sg/docs/default-source/default-document-library/publications/publications_and_papers/household_income_and_expenditure/pp-s22.pdf
Korean Statistical Information Service	KOR	http://kosis.kr/statHtml/statHtml.do?orgId=101&tblId=DT_1L6E001&conn_path=I2&language=en
National Statistical Office of Thailand	THA	http://web.nso.go.th/en/survey/house_seco/data/Whole%20Kingdom_13_FullReport.pdf

Source	Countries	Link
United Kingdom National Statistics	GBR	https://www.gov.uk/government/uploads/system/uploads/attachment_data/file/503472/SPI_National_Statistics_T3_1_to_T3_11.pdf
United States Census Bureau	USA	http://www2.census.gov/programs-surveys/demo/tables/p60/252/table3.pdf
Eurostat	BGR, CZE, DNK, HUN, ISL, LTU, LVA, NOR, POL, SWE	http://ec.europa.eu/eurostat/web/exchange-rates/data/database
OECD Statistics	AUT, BEL, BGR, CZE, CHE, DEU, DNK, ESP, EST, FIN, FRA, GRC, HUN, IRL, ISL, ITA, LTU, LUX, LVA, NLD, NOR, POL, PRT, SVK, SVN, SWE	http://stats.oecd.org/Index.aspx?DataSetCode=FIXINCLSA#
Federated States of Micronesia	FSM	http://prism.spc.int/images/documents/HEIS/2005_FSM_HIES_Report-Final.pdf
Department of Census and Statistics Ministry of Finance and Planning Sri Lanka	LKA	http://www.statistics.gov.lk/HIES/HIES2012PrelimineryReport.pdf
Bangladesh Bureau of Statistics—Ministry of Planning	BGD	http://catalog.ihsn.org/index.php/catalog/2257
Liberia Institute for Statistics and Geo-Information Services—Government of Liberia	LBR	http://microdata.worldbank.org/index.php/catalog/2563
Central Agency for Public Mobilization and Statistics (CAPMAS)—Arab Republic of Egypt	EGY	http://www.ilo.org/surveydata/index.php/catalog/1261
Namibia Statistics Agency	NAM	http://www.ilo.org/surveydata/index.php/catalog/320
China Institute for Income Distribution	CHN	http://www.ciidbnu.org/

Rights and permissions

Reprints and permissions

About this article

Cite this article

Tao, Y., Wu, X., Zhou, T. et al. Exponential structure of income inequality: evidence from 67 countries. J Econ Interact Coord 14, 345–376 (2019). https://doi.org/10.1007/s11403-017-0211-6

Download citation

Received: 23 November 2016
Accepted: 11 December 2017
Published: 27 December 2017
Issue Date: 01 June 2019
DOI: https://doi.org/10.1007/s11403-017-0211-6

Keywords

JEL Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Exponential structure of income inequality: evidence from 67 countries

Abstract

Similar content being viewed by others

Household income distribution in the USA

The fall in income inequality during COVID-19 in four European countries

Declining inequality in Latin America? Robustness checks for Peru

1 Introduction

2 Exponential income distribution

3 Empirical test for 67 countries

4 Consistent estimate of \(\mu \)

Proposition 3

Proof

Lemma 5

Proof

Corollary 1

Proof

5 Discussion

6 Conclusion

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Electronic supplementary material

Supplementary material 1 (docx 424 KB)

Appendices

Appendix

N-person non-cooperative game

Rawls’ fairness of “2-person allocation”

Density function of income distribution

Technological progress and entropy

Main propositions

1.1 Full sample

Theorem 1

Proof

Proposition 1

Proof

1.2 Truncation sample

Theorem 2

Proof

Proposition 2

Proof

Proposition 3

Lemma 1

Proof

Lemma 2

Proof

Lemma 3

Proof

Lemma 4

Proof

Proof of Proposition 3

Description of data sources

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation