1 Introduction

The Chinese economy has experienced phenomenal growth since 1978 when its transition to a market economy began. Over the period of our analysis from 1978 to 2011, real GDP per capita (at constant 2005 prices) grew from 1,582 to 27,309 Chinese yuan (CNY), which amounts to an average annual growth rate of 9.15 %. Initially, the economic growth also reduced income inequality; however income inequality rose substantially from the mid-1980s to the mid-1990s. The second half of the 1990s saw some reduction in inequality, but it did not last. Inequality continued to rise, reaching a peak in the mid-2000s before showing some signs of improvement (see Fig. 1). The broad pattern of income inequality gives rise to three questions: (1) is rising income inequality an inevitable “side effect” of economic development in its early stages? (2) Can we expect inequality to fall as the economy develops further? Or in other words, is there a Kuznets inverted-U relationship between income inequality and economic development in China? (3) What are some of the contributing factors to income inequality that we might be able to influence through government policy?

Fig. 1
figure 1

Real GDP per capita (at constant 2005 prices) and inequality 1978–2011. Data source China Compendium of Statistics 1949–2009, and 2010–2012 issues of China Statistical Yearbook

This paper uses both national time series and provincial panel data to investigate whether there was a Kuznets relationship between economic development and income inequality in China during the post-reform period of 1978–2011. It also studies other factors that may contribute to the observed inequality pattern. The main findings of our analysis are: (1) there was a Kuznets relationship between economic development and income inequality; (2) a driving force behind the non-linearity of the Kuznets process was urbanization; (3) low productivity in agriculture relative to that of the economy as a whole (i.e., dualism) was a significant contributing factor to income inequality; (4) the effect of education on inequality was not robust, but it appears that the expansion of secondary education and that of higher education had different effects on inequality; and (5) inflation exacerbates inequality. Our paper contributes to the existing literature in that (1) it lends empirical support to Kuznets’ original insight that urbanization is a main driver of the inverted-U relationship between inequality and economic development; (2) it establishes a direct empirical link between inequality and dualism in China; (3) it points to the possibility that education expansion at different levels may have different effects on inequality; and (4) it provides some evidence suggesting that inflation hurts the poor more than it hurts the rich.

The rest of the paper is organized as follows. Section 2 reviews related literature and explains how this paper contributes to it. Section 3 presents the empirical model and describes the data used in this study. Section 4 analyzes the estimation results. Section 5 concludes with some policy implications.

2 Literature Review

There is a vast literature on income inequality in China. We only survey two areas of studies that are closely related to this paper: the Kuznets hypothesis, and the pattern and some identified determinants of income inequality in China.

2.1 The Kuznets Hypothesis

Based on the statistical regularities he observed from historical economic data of England, Germany and the United States, Kuznets (1955) suggests that there is an inverted-U relationship between inequality and development: with inequality “widening in the early phases of economic growth when the transition from the pre-industrial to the industrial civilization was most rapid; becoming stabilized for a while; and then narrowing in the later phases” (p. 18). This is the well-known Kuznets hypothesis.

In his original work, Kuznets (1955) emphasized two drivers behind his hypothesis: the concentration of savings, and urbanization. Since upper-income earners generally save more, over time they would hold an increasing proportion of society’s total assets and as a result take an increasingly larger share of the total income. However, several factors have counteracted this savings concentration, for example, income redistribution policies, the increasing importance of service income, and the dynamism of a growing economy that offers more opportunities to all individuals.

On the role of urbanization, Kuznets (1955) argues that income tends to be more unevenly distributed in urban areas, and that the income gap between urban and rural residents does not necessarily narrow with economic development.Footnote 1 Given these tendencies, urbanization raises the share of the more unequal of the two component distributions, which increases overall inequality. In later stages of development, the widening of overall income inequality associated urbanization is more than offset by the narrowing of inequality within the urban sector as new migrants better adapt to urban life and obtain greater political power to support their claims for a larger income share. Thus the income inequality path takes the shape of an inverted-U.

While the features of the urbanization process as described by Kuznets (1955) would explain an inverted-U relationship between inequality and development, other researchers have shown that the simple fact that urbanization enables some initially poorer rural individuals to earn a higher income in urban areas could explain the Kuznets hypothesis. Using a simple two-sector model, Robinson (1976) demonstrates that even if the mean income and the income distribution for the urban and the rural sector remain unchanged, the overall inequality (as measured by the log variance of income) is a quadratic function of the urban population share. In other words, in a two-sector economy, overall inequality first rises and then falls as the share of urban population increases. Knight (1976) and Fields (1979) have obtained similar results with different measures of inequality. Knight (1976) explains the logic behind the inverted-U curve in the context of urbanization as follows. If everyone is initially in the rural sector and has the same low income, the Gini coefficient (G) is zero. If one person moves to the urban sector and receives a higher income without changing anyone else’s income, G goes up slightly. As more people move to the higher income sector, G continues to rise. When the number of people remaining in the lower-income rural sector falls to a certain level, G starts to fall. Therefore the process of urbanization would be accompanied by an initial increase and a subsequent decline of overall measured inequality.

A number of early cross-country empirical studies have confirmed the Kuznets relationship between income inequality and development (see for instance, Ahluwalia 1976; Lecaillon et al. 1984). However these studies have been criticized on both methodological and data comparability grounds (Saith 1983; Adelman and Robinson 1989; Anand and Kanbur 1993). It is argued that inter-temporal national studies rather than cross-country analyses are required to test the Kuznets hypothesis (Saith 1983). As an empirical investigation of the relationship between development and inequality in China over the post-reform period of 1978–2011, our paper provides a useful test of the Kuznets hypothesis. To our knowledge, few studies have specifically tested the Kuznets hypothesis in the Chinese context. One exception we find is Zhang et al. (2012) who, in the process of examining the effects of financial development on urban–rural inequality in China over the period 1978–2006, also identified an inverted-U relationship between urban–rural income gap and per capita real GDP. Different from Zhang et al. (2012), we focus on urbanization as the driver behind the Kuznets relationship in line with Kuznets’ original conjecture and subsequent theoretical work discussed above. Moreover, we consider a longer time period from 1978 to 2011 and use both national time series and provincial panel data.

2.2 Pattern and Determinants of Income Inequality in China

On the pattern of inequality in China, a number of distinctive features have been highlighted in the literature. First, the time path of inequality was non-linear. Kanbur and Zhang (2005) identify three peaks in inequality over the 50 years between 1950 and 2000: during the Great Famine (1961), at the end of the Cultural Revolution (1978) and in the current period of global integration (2000). Zhang and Zou (2012) also emphasize the rise and fall of inequality during different policy eras from central planning, cultural revolution to the reform era beginning in 1978. Second, rural–urban income disparity accounts for a large share of overall inequality in China (Hussain et al. 1994; Lin et al. 2004; Kanbur and Zhang 2005; Lu and Chen 2006; and Xie and Yang 2014). Third, poverty and inequality decreased substantially during the years of rural reform from 1978 to 1985 (Lu and Chen 2006; Ravallion and Chen 2007; and Knight 2014). However, the urban–rural income gap started to widen again in the mid-1980s (Zhang and Zou 2012). By 2002, per capita income of urban households were more than 3 times higher than that of rural households. The urban–rural income ratio started to fall in mid-2000s, but has remained above 3 (Li, et al. 2013).

There is an extensive literature on the determinants of income inequality in China. We only discuss some general findings and focus on a few determinants related to this paper. The first general conclusion to note is that inequality in China is not a result of the poor getting poorer, but rather of the rich getting richer much faster (Li et al. 2013). Based on the Rural Household Surveys and Urban Household Surveys, Ravallion and Chen (2007) demonstrate that over the period 1980–2001 when inequality rose, the incidence of extreme poverty fall drastically. Poverty rate has continued to fall in recent years. Using a PPP $1.25 a day definition of a poverty line, it is estimated that poverty rate fell from 19 % in 2002 to 8 % in 2007 (Li et al. 2013).

Secondly, it is well-recognized that understanding urban–rural income differences is crucial to understanding overall inequality in China. The urban–rural income gap can be attributed to a number of factors. Yang (2002) argues that the origin of China’s urban–rural divide can be traced back to the development strategy of the central planning era (1950s–1970s) which extracted agricultural surplus to support heavy-industries and urban development. To implement this strategy, the government artificially lowered the prices of agricultural commodities through unified procurement, set up people’s communes for collective agricultural production, and restricted rural to urban migration via the household registration (hukou) system. By the late 1970s, capital was mostly deployed in urban areas whereas rural labor was tied to collective agriculture. As a result, urban workers had much higher productivity and income than rural workers. Since 1978, a series of rural reforms were introduced, including the household responsibility system and greater labor mobility to non-agricultural activities. These reforms significantly increased farmers’ earnings and resulted in a reduction in urban–rural disparity between 1978 and 1985 (Yang 2002).

Unfortunately, the progress towards greater equality did not last. From mid-1980s, urban–rural inequality rose again. Many studies have looked into the reasons that possibly explain the reversal of the inequality trend. One main reason identified is the remaining barriers to labor mobility from rural to urban areas (Yang 1999, 2002; Lin and Chen 2011). Lin et al. (2004) observe that although the government gradually relaxed migration rules in the 1990s, most cities still denied permanent residency (“hukou”) to migrants. While mid-sized cities started granting permanent residency to migrants on selective bases in early 2000s, the hukou system remains a critical barrier to labor mobility (Zhang and Zou 2012). In particular, many rural migrants are unable to relocate to urban areas permanently as they face employment discrimination and high housing costs, and have difficulty accessing public services including education and healthcare (Li et al. 2013). As a result of the migration barrier, more labor remains in agriculture than is desirable and agricultural productivity suffers.

At the same time agricultural productivity growth was suppressed, labor productivity growth in urban industries has grown rapidly thanks to increasing capital intensity on account of massive investments over time. Some of the investments came directly from the government. Others were facilitated by government policies. For example, the government has kept the prices of fuel, electricity, water and land low and has not strictly enforced environment regulations (Kuijs and Wang 2006). These policies artificially inflated industrial profits and stimulated investment in urban industries. In addition, industrial firms, in particular large firms and state-owned-enterprises, were given preferential access to bank credit (Gordon and Li 2003; Allen et al. 2005).

As labor productivity growth in agricultural was hindered, and that in urban industries was promoted, the productivity gap between agriculture and the rest of the economy grew. This productivity gap is referred to in the literature as dualism. Nielsen (1994) and Bourguignon and Morrisson (1998) find dualism to be an important explanatory factor of income inequality in cross-country studies. However there seems to be a lack of empirical evidence on how dualism affects inequality in the Chinese context. A contribution of this paper is that it establishes a direct empirical link between dualism and inequality in China.

Apart from dualism, a considerable number of other determinants of inequality in China have been identified in the literature, including education (Zhang et al. 2012), financial development (Jalil and Feridun 2011), fiscal decentralization (Huang and Chen 2012), government transfers (Li et al. 2013), inflation (Ravallion and Chen 2007), openness (Jalil 2012) and taxation (Tao et al. 2004). We focus on education and inflation here. There are mixed views about the effect of education expansion on inequality in China. On the one hand, some researchers attribute urban–rural income differences to differences in education opportunities and outcomes, and argue that education expansion would improve human capital in rural areas and reduce income inequality (Sicular et al. 2007; Gustafsson et al. 2008). On the other hand, Zhang et al. (2012) find education expansion to be correlated to a worsening of rural–urban inequality in Eastern China probably due to brain drain in rural areas. In the existing literature, education expansion is typically measured by secondary school enrollment. In this paper, we measure education separately by higher education enrollment and secondary school enrollment, and examine whether they had different effects on inequality.

As for the effect of inflation on inequality, the results are mixed in different country studies (see for instance, Easterly and Fischer 2001; Bulir 2001; Clarke et al. 2006). In the Chinese context, the effect of inflation on inequality does not seen to have received much research attention although Ravallion and Chen (2007) have found that inflation hurts the poor. In this paper, we discuss the possible mechanisms through which inflation may improve or exacerbate inequality, and provide new evidence which suggests that on balance, inflation has the effect of worsening inequality in China.

3 Empirical Models

3.1 Model Specifications

We consider four factors that determine income inequality (TT) in China: urbanization (URBAN), dualism (DUAL), inflation (INF), and education (EDU)

$$TT = f(URBAN,DUAL,INF,EDU)$$
(1)

where EDU may be either higher education (HEDU) or secondary education (SEDU).

Based on Eq. (1), we can specify two empirical models, one with higher education and another with secondary education:

$$\begin{aligned} \ln (TT) = & \alpha_{0} + \alpha_{1} \ln (URBAN) + \alpha_{2} (\ln (URBAN))^{2} + \alpha_{3} \ln (DUAL) + \alpha_{4} \ln (INF) \\ & \quad + \alpha_{5} ln(HEDU)_{ - 5} + u \\ \end{aligned}$$
(2a)
$$\begin{aligned} \ln (TT) = & \beta_{0} + \beta_{1} \ln (URBAN) + \beta_{2} (\ln (URBAN))^{2} + \beta_{3} \ln (DUAL) + \beta_{4} \ln (INF) \\ & \quad + \beta_{5} ln(SEDU)_{ - 5} + \varepsilon \\ \end{aligned}$$
(2b)

Equations (2a) and (2b) specify a non-linear relationship between urbanization and inequality in line with the Kuznets hypothesis. As noted in the last section, a driving force behind the non-linear relationship between income inequality and development is the urbanization process. That is, as an economy develops, a larger share of the population moves to urban areas and earn a higher income. This movement leads to an initial increase and a subsequent fall in inequality (Kuznets 1955; Knight 1976). If the Kuznets relationship applies to the Chinese experience, we would find the coefficients of \(\ln (URBAN)\) (i.e., \(\alpha_{1}\) and \(\beta_{1}\)) to be positive and the those of \((\ln (URBAN))^{2}\) (i.e., \(\alpha_{2}\) and \(\beta_{2}\)) to be negative.

The second determinant of income inequality in our model is dualism (DUAL). As noted earlier, dualism is a measure of productivity difference between agriculture and the rest of the economy. Standard neoclassical economic theory postulates that if marginal productivity of a factor is higher in one sector than another, factors of production would be attracted to the sector with higher marginal productivity. Factor movement would continue until marginal productivities in all sectors are equalized, which means factor income should also tend to equalize. In real economies, however, such factor movements may be significantly constrained so that dualism results which in turn produces income disparity across sectors. In China, labor movements are restricted by the system of household registration (“hukou”), and capital allocation is also biased in favor of the urban sector, both leading to dualism. Dualism affects inequality because productivity differences correspond to different income-generating abilities. The higher the degree of dualism, that is, the more productivity in agriculture lags behind that in other sectors, the lower income rural residents are likely to earn relative to urban residents. Thus we expect the coefficients of \(\ln (DUAL)\) (i.e., \(\alpha_{1}\) and \(\beta_{1}\)) to be positive.

The third determinant of inequality in our model is inflation (INF). The study of re-distributional effect of inflation can be traced back to Cantillon (1755), who links inflation to an increase in money supply. He contends that where there is an increase in money supply, the new money enters the economy at a specific point, which means some people receive the new money first. The first receivers of new money spend it, so the money reaches their suppliers who in turn pass it on through their own purchases. In this way, the new money permeates the economy via multiple sequential transactions. The early recipients of the new money benefit at the expense of the late receivers because their income increase before prices increase for all the goods they buy; whereas the late recipients experience higher prices before their income levels rise. Since higher income earners tend to be politically more powerful and have better access to finance, they are more likely to receive the new money first and benefit from inflation (Bai and Cheng 2014). That is, inflation driven by a monetary expansion would redistribute wealth from the poor to the rich, thereby exacerbating inequality.

On the other hand, Lewis (1954) argues that in a dual economy with “unlimited supplies of labor”, credit creation can facilitate the employment of more labor to speed up capital formation. The expansion of credit will lead to inflation in the short run, but prices will fall after output expands with the increased capital investment. Before more output is produced however, the existing quantity of output is redistributed to the newly employed workers at the expense of the rest of the community and the income share of capital owners rises as more capital is accumulated. The increased employment tends to reduce income inequality but the higher share of capital income tends to raise it, so the net effect depends on the relative magnitudes of the two forces.

While the theories do not give a clear prediction about inflation’s net effects on inequality, we suspect that inflation driven by credit expansion in China had more of the effect of enriching the privileged class than creating job opportunities benefiting the poor. Thus we hypothesize that inflation had a net effect of widening inequality in China, that is, we expect \(\alpha_{4}\) and \(\beta_{4}\) to be positive.

The fourth determinant of inequality in our model is education. It is generally believed that in the long run, education is an important income equalizer for at least two reasons. First, low income families can more easily acquire human capital through education than accumulate physical or financial capital through savings or inheritance. Secondly, unlike physical capital accumulation that is prone to concentration, the expansion of human capital involves dispersion of knowledge and skills across the wider population (Ahluwalia 1976). However, in the short run, education expansion may be associated with higher inequality. For instance, if people from high income families have better education opportunities, overall inequality may increase during the course of education expansion (Nielsen 1994). Also, the income gap between the educated and the uneducated may increase as skill-biased technological change in recent decades has raised the return to education (Acemoglu 2002). Moreover, in the Chinese context, as migrants to urban areas tend to be more highly-educated, the brain drain in rural areas hinders rural sector productivity growth, thereby aggravating urban–rural income inequality.

We measure education separately by higher education enrollment (lagged by 5 years) and secondary school enrollment (lagged by 5 years), and examine whether they had different effects on inequality. To the extent that higher education is one path for talented young people in rural areas to find highly-paid employment in cities, the expansion of higher education may result in brain drain in rural areas, thereby widening rural–urban inequality. Secondary education expansion on the other hand may have a different effect. As an important way of accumulating human capital, secondary education improves the labor productivity and income earning abilities of all those receiving the education. The expansion of secondary education is likely to benefit the rural region more because the rural region started from a lower secondary school enrollment rate, and would receive a relatively greater improvement in education opportunities. Thus, we hypothesize that higher education expansion would have an inequality-widening effect, whereas secondary education expansion would have an inequality-narrowing effect. That is, we expect \(\alpha_{5}\) to be positive and \(\beta_{5}\) to be negative.

We performed the Chow Test for structural change and identified year 1992 as a breakpoint in the national time series. However year 1992 is not identified as breakpoint in the provincial panel data. Therefore we have included a time dummy T1992 (with T1992 = 0 for years 1978–1992; and = 1 for years 1993–2011) in our national time series estimation. Two events in 1992 help explain why there might be a structural break. First, following Deng Xiaoping’s southern tour in 1992, the Chinese central government endorsed the notion of “socialist market economy” and sped up the pace of economic reforms. Second, China’s adoption of the United Nations System of National Accounts 1993 marked a major step towards an international standard of national accounting.

3.2 Data and Variable Definitions

We use both national time series and provincial panel data for the period 1978–2011 to estimate Eqs. (2a) and (2b). The provincial panel data contain information for 31 province-level divisions of administrative areas (which includes 22 provinces, 5 autonomous regions and 4 directly-administered municipalities). The time series and panel data for 1978–2008 are from China Compendium of Statistics 19492009. The time series data for 2009–2011 are from 2010 to 2012 issues of China Statistical Yearbook. The panel data for 2009–2011 are from 2010 to 2012 issues of China Statistical Yearbook for Regional Economy.

The definitions of all variables in our model together with their corresponding data sources are presented in Table 1. The national time series data are in “Appendix 2”. The summary statistics of the panel data is provided in Table 2. We provide further details below.

Table 1 Definitions of variables
Table 2 Summary statistics

The Theil index (TT) is our measure of income inequality. We have computed TT from provincial data on rural and urban incomes and populations (see the “Appendix 1” for calculation details). It is important to note that in the national time series, TT can be understood as capturing three components: inequality between rural and urban residents, inequality within rural residents across different provinces and inequality within urban residents across different provinces. In contrast, TT in the provincial panel data capture only inequality between rural and urban residents.

URBAN is the degree of urbanization measured by the share of urban population in total population. The degree of urbanization has increased substantially over our data period. In 1978, about 17.9 % of the population resided in urban areas. By 2011, the figure had risen to 51.3 %.

DUAL is measured by the inverse of agricultural labor productivity relative to labor productivity for the economy as a whole, so that a larger value of DUAL indicates a lower relative productivity in agriculture. Since the primary sector in China contains mainly agriculture, it is often treated as being “equivalent to” agriculture in the literature (Fan et al. 2011). We thus use primary sector productivity as a proxy for agricultural productivity. DUAL fell from 2.5 in 1978 to 1.99 in 1984, then started to rise, reaching a peak of 3.8 in 2003. In 2011, DUAL remained at a high level of 3.5.

INF is measured by the consumer price index series with preceding year equal to 100. HEDU is higher education enrollment per 10,000 population (lagged by 5 years). Higher education enrollment increased substantially from 3.52 in 1973 to 132.28 in 2006. SEDU is secondary education enrollment per 100 population (lagged by 5 years). Secondary enrollment increased from 3.86 in 1973 to 7.82 in 2006.

4 Estimation Results

We conduct our time series estimation of Eqs. (2a) and (2b) using the Autoregressive Distributed Lag model (ARDL) advocated by Pesaran (1997) and Pesaran and Smith (1998). This approach has been widely used in time series analyses, including studies of inequality (see, for instance, Jalil 2012). The ARDL procedure consists of three steps. The first step involves selecting the appropriate lag orders of the ARDL model using either the Akaike Information Criterion (AIC) or the Schwartz Bayesian Criterion (SC). A variable Addition Test (ARDL case) is conducted to see whether there exists a long-run relationship among the variables. If the null hypothesis of no co-integration is rejected, one proceeds to the second step of estimating the long-run relationship using the selected ARDL model. In the third step, an error correction model is estimated, providing information on the speed of adjustment back to the long-run equilibrium following a shock.

We use the (province) fixed effects model for our panel data estimation. The model is estimated with first-differenced variables.

Before the models are estimated, we first test whether the variables under consideration are stationary. The test results for the time series and panel data are reported in Tables 3 and 4, respectively. The results suggest that all variables in first differences are stationary, which means that our estimation methods can be applied. For the time series, we also test the existence of a long term relationship among the variables (which is the second step of the ARDL method as described earlier). The test (reported in Table 5) indicates that a long run relationship exists for each model.

Table 3 Unit root tests: national time series 1978–2011
Table 4 Unit root tests: provincial panel data 1978–2011
Table 5 Variable addition tests for the existence of a long relationship

The results from the estimating our empirical model [Eq. (2a) and (2b)] are presented in Table 6. There are four estimation results, two (estimations 1 and 2) based on national time series data and two (estimations 3 and 4) based on provincial panel data.

Table 6 Estimation results

4.1 National Time Series Analysis

The model specifications for the two national time series estimations are the same except that in estimations 1, education is measured by higher education enrollment [ln(HEDU)] whereas in estimation 2, it is measured by secondary education enrollment [ln(SEDU)]. Since the estimated coefficient of ln(SEDU) is insignificant but that of ln(HEDU) is, we consider higher education to be a more suitable measure of education, and therefore we focus on the results of estimation 1 for the national time series data analysis.

In estimation 1, the coefficient of ln(URBAN) is positive and significant; and that of LNURBAN 2 is negative and significant. This is consistent with the theoretical prediction that urbanization is an important driver behind the Kuznets process. It indicates that the Chinese development experience confirms the Kuznets hypothesis that there is an inverted-U relationship between income inequality and development. Specifically, the coefficients of ln(URBAN) and [ln(URBAN)]2 combined imply that the Kuznets turning point is at where the rate of urbanization is about 25 (=\(e^{25.8252/(2 \times 4.0138)}\)) percent. This occurred between year 1986 and 1987 (see “Appendix 2”). It is clear from Fig. 1 that inequality continued to rise in 1987, and did not peak until 2003. This suggests while increasing urbanization above 25 % had the effect of reducing inequality, this effect was more than offset by other factors which continued to push up inequality. These other factors included dualism [ln(DUAL)], inflation [ln(INF)] and higher education [ln(HEDU)].

The estimated coefficient of ln(DUAL) is positive and significant. Noting that a high value of ln(DUAL) means low agricultural productivity relative to productivity of the economy as a whole, the positive coefficient of ln(DUAL) confirms our conjecture that low productivity in agriculture is likely to be associated with high overall income inequality. The importance of dualism in explaining inequality in China is consistent with the fact that a substantial proportion of overall inequality in China is attributable to urban–rural inequality (Lin and Chen 2011). This result is also in line with the findings of Nielsen (1994) and Bourguignon and Morrisson (1998). Estimation 1 indicates that a 1 % increase in dualism leads to a 2.8 % increase in inequality.

The estimated coefficient of ln(INF) is also positive and significant. This lends some support to our conjecture that the inflation in China benefited the rich and privileged (in the form of easier access to credit) more than it benefited the poor (in the form of short term employment opportunities). Therefore the net effect of inflation on inequality was positive. It is estimated that a 1 % increase in inflation was associated with a 1.2 % increase in inequality.

In addition, the coefficient of ln(HEDU) is positive and significant, suggesting that higher education expansion was associated with an increase in inequality. This is probably due to unequal education opportunities and brain drain from poor regions of the country. A 1 % increase in higher education enrolled was associated with a 0.6 % increase in inequality. Finally, the time dummy variable Y1992 is significantly negative (at 10 % level) suggesting a lower level of inequality after 1992.

4.2 Provincial Panel Data Analysis

Similar to the national time series estimations, we use two different measures of education: higher education enrollment (estimation 3) and secondary education enrollment (estimation 4). As shown in Table 5, the estimated coefficient of secondary education [ln(SEDU)] is negative and significant at 1 % level, in particular, an increase in secondary enrollment by 1 % was associated with a fall in inequality by 0.7 %. This is consistent with our conjecture that the expansion of secondary education benefits poorer rural areas more because in richer urban areas secondary education was already widespread during our data period. However, contrary to the result in the national time series analysis above, an expansion of higher education also had statistically significant (at 10 % level) negative effect on inequality. This suggests that possibly the brain drain effect encouraged by higher education participation operated more at the inter-provincial level—the brightest students from poor provinces went to better universities in richer provinces and worked in richer provinces after graduation. Since provincial data do not capture inter-provincial income differences, the expansion of higher education mainly had the effect of enhancing human capital, which in turn narrowed income inequality although the quantitative impact was small: a 1 % increase in higher education expansion was associated with a 0.15 % decrease in inequality at the provincial level.

Now turn to our main question of whether there was a Kuznets inverted-U relationship. In both estimations 3 and 4, the estimated coefficient of ln(URBAN) is positive and significant at 1 % level, and that of [ln(URBAN)]2 is negative and significant at 1 % level, confirming the Kuznets hypothesis. Estimation 3 suggests a Kuznets turning point at urbanization level of 31 (=\(e^{ 5. 4 3 6 5/(2 \times 0. 7 9 2 3)}\)) percent whereas estimation 4 suggests a Kuznets turning point at urbanization level of 30 (=\(e^{ 5. 0 7 4 4/(2 \times 0. 7 4 5 9)}\) percent. On average this level of urbanization was achieved in 1985 (see Table 2). In general, richer provinces and provinces with a heavy industrial bias reached the turning point earlier than poorer ones (see Table 7). For example, Beijing and Helongjian reached the turning point before the beginning of our data period of 1978, whereas Guizhou did not reach it until 2010.

Table 7 Kuznets turning points

Consistent with the national time series analysis, both dualism [ln(DUAL)] and inflation [ln(INF)] are found to have had the effect of exacerbating inequality. A 1 % increase in dualism was associated with an increase in inequality by about 1.0 % (estimation 3) to 1.1 % (estimation 4). A 1 % increase in inflation was associated with an increase in inequality by 0.9 % (estimation 4) to 0.95 % (estimation 3).

5 Conclusion

In this paper, we have studied the pattern and determinants of overall income inequality in the post-reform Chinese economy of 1978–2011 using both national time series and provincial panel data.

We have identified in both datasets a Kuznets inverted-U relationship between income inequality and economic development and have shown that urbanization was an important driver of the Kuznets process. Further we estimate that the Kuznets turning point for the national time series is at the urbanization rate of about 25 %, and that for the panel data is at the urbanization rate of about 30 %. For both datasets, the turning point occurred in the mid-1980s. Since inequality continued to rise after the mid-1980s, we argue that while increased urbanization after the mid-1980s had the effect of narrowing inequality, this effect was more than offset by other factors. In particular, we found that dualism and inflation were significant contributing factors to income inequality.

A couple of implications following from the results of our paper are worth noting. First, since measured inequality rises with the increasing relative size of the higher-income urban population in the initial stages of development even if the relative average income between rural and urban residents remain constant (Knight 1976; and Fields 1979), measured inequality by itself does not give us sufficient information about the well-being of different social groups. To have a clear understanding of the welfare implications of inequality, it is important to also look at more detailed information instead of focusing on a single aggregate statistic. For instance, it will be informative to look at how population sizes change for groups of different income levels over time.

Secondly, the importance of dualism in explaining inequality (after controlling for urbanization) suggests that improving agricultural productivity not only enhances efficiency but also is likely to be an effective way of reducing inequality. From the beginning of the reforms in 1978 to the mid-1980s, agricultural productivity increased significantly with the implementation of the household responsibility system and with the rapid growth of township and village enterprises (TVEs) absorbing underemployed agricultural labor. During the same time, inequality fell substantially (see Fig. 1). The increased inequality in subsequent years may be partly attributable to urban-biased policies such as tightened state control of the financial sector severely hindering rural sector development (Huang 2012). To address growing public concern over inequality, policies should be directed to facilitate improvement in the rural sector. For instance, the rural sector’s access to banking finance should be improved; the hukou system of household registration should be further relaxed to allow freer movement of labor between urban and rural areas; and the urban-bias in public investment spending should be corrected.