Introduction

Many policymakers, governments, and economists view sustained development and the worldwide impact of China as both alarming and impressive. China raised almost half of its deprived population above the poverty line in a very short period (Shen et al. 2018 and Li et al. 2021a, b). Apart from several regional disparities, economic, and social issues, this change occurs at the cost of severe environmental issues. OECD (2018) highlighted that although leading worldwide rankings in various groups, including exports and imports, FDI, China is also at front-runner position for production and consumption of energy, carbon emissions, and greenhouse gas emissions. Nonetheless, China is very much committed to making efforts for attaining the Sustainable Development Goals (SDGs). In the United Nations General Assembly conference of 2015, the member states vowed to eradicate poverty and hunger, provision of quality-based and equal education opportunities, reduction in carbon emissions, nitrogen gas, and certain other detrimental pollutants (Tiwari et al. 2020 and Yan et al. 2021). Modifications in economic and social factors such as knowledge, innovation, income, and FDI can enhance the environmental quality during the path of economic development (Aslam et al. 2021; Jafri et al. 2021; Jan et al. 2021; Ullah et al. 2021). However, literature reported that economic factors are more responsible as compared to social factors for CO2 emissions (Hamid et al.2020; Jinru et al. 2021; Mujtaba, et al. 2021; Mujtaba and Jena 2021).

In the view of researchers, education may contribute significantly to solving various ecological, economic, and social issues (Yin et al. 2021). As a social paradigm, education works as an influential instrument for altering lives, as it helps in improving innovation capabilities, skills, and knowledge of the economies (Bangay 2016). Meanwhile, education helps in enhancing the quality of the environment by creating consciousness about pollutants and environmental issues and helps in circulating compulsory wisdom mandatory for the enactment of green standards of living, green production, eco-innovation, and carbon decline policies for attainment of viable future, particularly for developing economies (Khattak et al. 2020). The policymakers and researchers argue that higher education and economic growth are closely related due to several implications for technological development and knowledge. But the empirical literature is insufficient on the nexus between environmental pollution and higher education, revealing that economists and environmentalists have paid diminutive consideration to this imperative issue (Uddin 2014). The available literature on this issue represents two strands of research. The first strand infers that education plays a significant positive role in reducing CO2 emissions, and the second strand of literature demonstrates on-campus accomplishments for higher CO2 emissions.

First strand of literature claims that education increases CO2 emissions through numerous educational activities. For example, the study done by Caird et al. (2015) investigated the education-carbon footprint nexus for 15 higher educational institutions in the UK. The findings of the study reveal that on-campus educational activities result in increasing carbon emissions; however, online educational activities result in reducing CO2 emissions. The study highlighted some factors that are linked with on-campus educational activities, such as consumption of energy construction, and certain other campus-related operation. For Japan, Parra et al. (2018) reported that teaching activities, studies, and on-campus miscellaneous work contributed significantly to raising CO2 emissions. Versteijlen et al.’s (2017) study for the Netherland highlighted that frequent traveling by students and staff are the main source of high carbon emissions and online educational setup tends to reduce global and regional CO2 footprint. The literature highlighted some other determinants of carbon emissions that are directly linked with on-campus educational activities such as air transportation (staff and students sometimes travel through long distances), food, housing activities, consumption, and mobility (Sippel et al. 2018).

The second strand of literature argues that education exerts a significant negative impact on carbon emissions through various transmission channels. Li and Zhou (2019) examined that how demographic structure and higher education affect carbon emissions at the national and provincial levels in China. The study reported a negative relationship between higher education and CO2 emissions in the case of East China. Uddin (2014) concluded that an increase in educational expenditures augments green economic growth and mitigates carbon emissions through increased awareness. A bulk of literature infers that improvement in the education sector mitigates carbon emissions (for example, Sarwar et al. 2019; Subramaniam and Masron 2020). Conversely, the study done by Balaguer and Cantavella (2018) investigated the linkage between environmental quality and education in the context of the environmental Kuznets curve (EKC) framework. The study reported harmful impacts of higher education on environmental quality in the early stages, but afterward, when the education level achieves a certain threshold, it starts improving the quality of the environment. Educational attainment can be used to mitigate CO2 emissions via environmental awareness, energy efficiency, green technology, and green growth (Çakar et al. 2021; Mankiw et al. 1992; Yang et al. 2017). Human capital is used to adopt new technologies and pollution-free sources in the industrial sector (Dasgupta et al. 2000), which in turn improves the environment. Educational attainments can significantly alleviate environmental costs (Manderson and Kneller 2012). In the end, a detailed summary of the empirical literature is reported in Table 1.

Table 1 Summary of literature

China has taken several initiatives in its higher education field in terms of higher education institutes, quality of education, and student enrollment. An extraordinary increase has been observed in both the number of student enrollments and the number of higher education institutions from the time period 2000 to 2019. For example, the number of students has increased from 9.398 million to 40.02 million, and the number of institutes has increased from 1813 to 2688 during this period. Furthermore, the expenditures on the education sector have increased up to 26% of the GDP in this period. Regardless of the anticipated theoretical importance of education, the CO2 emission and education association remain widely unexplored in the literature of environmental economics. The empirical findings from available studies provide mixed evidence on the linkage between education and CO2 emissions. Some scholars reported a positive impact of education on CO2 emissions, and others demonstrated the negative impact of education on CO2 emissions (for instance, Balaguer and Cantavella 2018; Sippel et al. 2018; Li and Zhou 2019). In the case of China, there is only one study available that investigated the impact of higher education on CO2 emissions for East China by using total student enrollments in thirteen higher education institutes as a proxy for measuring higher education and reported a negative impact of higher education on CO2 emissions.

The linkage between CO2 emissions and educational attainment has been explored in previous literature, but there exist various shortcomings that need to be considered. Firstly, existing studies have explored the nexus between human capital and CO2 emissions at an aggregate level; little attention is given to disaggregated level analysis. Secondly, although existing studies have explored the impact of educational attainment on CO2 emissions, the importance of educational attainment in the EKC hypothesis framework is still unexplored. Thirdly, considerable literature has explored the association between educational attainment and CO2 emissions, but no clear consensus has been achieved yet. Lastly, a lot of literature addresses the educational attainment implications for CO2 emissions based on cross-country data, but the said relationship has not been explored for China yet. Thus, our primary objective of the study is to fill the knowledge gap in environmental economics by examining the EKC hypothesis for different levels of education and economic development.

This study contains both practical applications and theoretical contributions. Considering the theoretical contribution, this study will strengthen and enrich the existing literature in the context of EKC hypothesis using aggregate and disaggregate levels of education. It will help in understandings the effects of educational attainment on CO2 emissions. On the practical ground, this study aims at providing important implications for China to design more appropriate strategies for educational attainment, environmental sustainability, and green growth. Moreover, these implications can also be considered for other developed and developing high-polluted economies. This study endorses the significant role of educational attainment for environmental sustainability and design appropriate policy measures to achieve a clean environment.

Model and method

The theoretical base of the model is derived from the EKC model. The EKC model was first presented by Grossman and Krueger (1995), which confirmed the inverted U-shaped relationship between GDP and CO2 emissions. According to the EKC hypothesis, at the initial level of economic development, CO2 emissions rises, and at the later stage of the economic development, CO2 emissions decline (Hamid et al. 2021 and Isik et al. 2021). Balaguer and Cantavella (2018) augmented the EKC model for Australia with the variables of education. Following them, we also have augmented the EKC model of China with different variables of education. Hence, the baseline looks as follows:

$${\mathrm{CO}2}_{\mathrm{t}}= {\upbeta }_{0}+ {\upbeta }_{1}{\mathrm{X}}_{\mathrm{t}}+{\upbeta }_{2}{{\mathrm{LnY}}^{2}}_{\mathrm{t}}+ {\upbeta }_{3}{\mathrm{LnZ}}_{\mathrm{t}}+ {\upvarepsilon }_{\mathrm{t}}$$
(1)

where the dependent variable is CO2 emissions, a proxy of environmental degradation. X is a set of main variables that include primary education (PE), secondary education (SE), tertiary education (TE), the average total years of schooling (AE), and economic growth (GDP). Then, Y2 is a set of variables where we have taken the square of each of the main variables such as a square of primary education (PE2), square of secondary education (SE2), square of tertiary education (TE2), square of average total years of schooling (AE2), and economic growth (GDP2). Lastly, Zt is a set of control variables that include energy consumption (EC), individuals using the internet (Internet), and government expenditure (GE). Model (1) is only offering the long-run estimates. To find the short-run outcomes with the long run, we reformulate the basic model in an error correction format. Our extended model is shown below:

$${\Delta \mathrm{CO}2}_{\mathrm{t}}= {\upbeta }_{0}+ \sum_{\mathrm{k}=1}^{\mathrm{n}1}{\upbeta }_{1\mathrm{k}}{\Delta \mathrm{CO}2}_{\mathrm{t}-\mathrm{k}}+ \sum_{\mathrm{k}=0}^{\mathrm{n}2}{\upbeta }_{2\mathrm{k}}{\Delta \mathrm{X}}_{\mathrm{t}-\mathrm{k}}+ \sum_{\mathrm{k}=0}^{\mathrm{n}3}{\upbeta }_{3\mathrm{k}}{{\mathrm{Y}}^{2}}_{\mathrm{t}-\mathrm{k}}+ \sum_{\mathrm{k}=0}^{\mathrm{n}4}{\upbeta }_{4\mathrm{k}}{\mathrm{Z}}_{\mathrm{t}-\mathrm{k}}+ {\upgamma }_{1}{\mathrm{CO}2}_{\mathrm{t}-1}+ {\upgamma }_{2}{\mathrm{X}}_{\mathrm{t}-1}+ {\upgamma }_{3}{{\mathrm{X}}^{2}}_{\mathrm{t}-1}+ {\upgamma }_{4}{\mathrm{Z}}_{\mathrm{t}-1}+ {\upvarepsilon }_{\mathrm{t}}$$
(2)

Equation (2) is an error-correction model where coefficient estimates \({\upbeta }_{1\mathrm{k}}\), \({\upbeta }_{2\mathrm{k}}\), \({\upbeta }_{3\mathrm{k}}\), and \({\upbeta }_{4\mathrm{k}}\) reveal short-run effects and estimates of \({\upgamma }_{1}\),\({\upgamma }_{2}\),\({\upgamma }_{3}\), and \({\upgamma }_{4}\) are long-run effects. A former study of Pesaran et al. (2001) recommends two diagnostic tests (e.g., F-test and t-test) to establish cointegration (Sohail et al. 2021). F-test null hypothesis is that H0: \({\upgamma }_{1}\)=\({\upgamma }_{2}\)=\({\upgamma }_{3}={\upgamma }_{4}\)=0 but the alternative is H1:\({\upgamma }_{1}\)≠0,\({\upgamma }_{2}\)≠0,\({\upgamma }_{3}\)≠0, and \({\upgamma }_{4}\)≠0. The alternative hypothesis infers the existence of a long-run relationship among concern variables. Indeed, under the ARDL approach, variables could be a mixture of I(1) and I(0). Long-run as well short-run effects can be easily judged in a single step via ARDL. The ARDL method has been used in empirical research works in numerous contexts (Usman et al. 2021a, b) which advice that the ARDL model is more suitable for short- and long-run analysis. The ARDL offers different estimates in short and long run at different lags order in a model. The ARDL diagnostic tests are also equally suitable with short- and long-run estimates (Lei et al. 2021). Thus, to check for serial correlation, we employ the Lagrange multiplier (LM) test. We use the Breusch–Pagan (BP) test to measure the heteroskedasticity and Ramsey’s RESET test to find misspecification. Finally, CUSUM and CUSUM-sq tests are described to judge the stability of the long- and short-run coefficient estimates.

Data

The study aims to investigate the role of educational activities in determining the EKC in case of China for time horizon 1991 to 2019. The dependent variable EKC is measured by carbon dioxide emissions and greenhouse gas emissions. The study utilizes four proxies to measure the role of educational activities in China such as primary education, secondary education, tertiary education, and average education. These education levels are measured as school enrollments at gross percentage of primary, secondary, and tertiary levels; however, average education is measured as average total years of schooling. Some other variables namely GDP per capita measured at constant 2010US$, energy consumption in kg of oil equivalent per capita, use of internet in the percentage of the population, and general government final consumption expenditures in the percentage of GDP are taken as control variables. Table 2 gives us information about variables and definitions. Data for all variables have been sourced from the World Bank and the human development index.

Table 2 Variables and definitions

Results and discussion

Our aim in this study is to investigate the role of school enrollment at various levels in the formation of the EKC in China. To get estimates of the variables, we employ the ARDL model. ARDL has various advantages over other techniques, but the foremost advantage of this technique is that it performs well even if the variables are a mixture of I(0) and I(1). However, we cannot include the variables of I(2) in the analysis. Therefore, to confirm that none of the included variables is I(2), we employ two unit root tests, one without a structural break and another with a structural break. Table 3 provides the findings of both the unit root tests. From the results, we can deduce that most of the variables are I(1) and some are I(0) by using either of the tests; however, none of the variables in the analysis is I(2). These findings give us a positive signal that we can apply the ARDL model. Another important thing that needs to be decided before starting our formal discussion is selecting the appropriate lag. Our data is annual, and we apply a maximum of two lags, and to choose the correct number of lags, we use Akaike information criterion (AIC).

Table 3 Unit root testing

We have used four different proxies for education for empirical analysis: enrollment at primary, secondary, and tertiary levels. Moreover, we also include the average total years of schooling to get the accumulative impact of education on the CO2 emissions. First, we discuss the short-run estimates, and after that, we discuss the ones in the long run.

Table 4 illustrates the short-run and long-run results of the ARDL model. In short run, the estimated coefficient of D(PE) is positively significant, and the estimate of D(PE2) is negatively significant. Similarly, the estimate attached to D(SE) is positive, and the estimate attached to D(SE2) is negative and significant. Then the estimate attached to D(TE) is positively significant, whereas the estimate attached to D(TE2) is negatively significant. Lastly, the estimates attached D(AE) is positive at first lag, and the estimate attached to D(AE2) is negatively significant at first lag. These findings confer that increased enrollment at all levels does increase the CO2 emissions; however, at the later stage, increased enrollment at all levels decreases the CO2 emissions. The estimated coefficients of D(GDP) are positively significant in all models, and the estimates attached to D(GDP2) are negatively significant, confirming the presence of EKC in the short run. Given the importance of long-run results, we now pay attention to the long-run estimates provided in Table 4. The validity of the long-run results depends on the confirmation of cointegration between them. To that end, we rely on two tests of cointegration, i.e., F-test and ECMt-1, and the results of both the tests are provided in Table 4, which confirm that cointegration exists among the CO2, SE, PE, TE, AE, GDP, EC, Internet, and GE.

Table 4 ARDL estimates of CO2 emissions

The long-run estimates attached to PE, SE, TE, and AE are positively significant, or more precisely, we can say that a 1% rise in primary, secondary, tertiary, and aggregate enrollment increases the CO2 emissions by 1.570%, 1.218%, 0.008%, and 0.580%. Conversely, the estimates attached to PE2, SE2, TE2, and AE2 are negatively significant, or in terms of elasticity, we can say that a 1% rise in PE2, SE2, TE2, and AE2 causes the CO2 emissions to decline by 2.058%, 0.171%, 0.023%, and 0.580%. This result is consistent with Duarte et al. (2012), who infer that a lower level of educated consumers produces more CO2 emissions in Spain. Our findings are also backed by various studies such as Li et al. (2021a, b), who argued that improvement in human capital through higher education can raise awareness among people regarding the utilization of environment-friendly technologies, thus reducing CO2 emissions. Furthermore, it is supported by the argument that development in human quality reduces the abatement activities that tends to a significant decline in CO2 emission. In contrast, Li and Zhou’s (2019) study argues that higher education tends to increase CO2 emissions in the highest income most developed region of China.

Generally, our findings imply that an increase in educational activities at all levels increases the CO2 emissions at the early stages; however, later on, education reduces the CO2 emissions. In other words, the relationship between CO2 emissions and all levels of education follows an inverted U-shaped path, implying that CO2 emissions increase in the early parts of increased educational activities and decline at the later stages. As the demand for education in the country increases, on one side, more educational infrastructure such as schools, colleges, universities, and hostels are required; on the other side, demand for transportation facilities, laundry, dry cleaning, and saloon services will also increase (Sippel et al. 2018). As a result, the energy demand rises, which is a primary driver of CO2 emissions. However, the positive effects of education on the environment may come later once the education sector starts producing more trained, skilled, capable, and efficient human resources that can replace the more energy-intensive inputs in the production process and ultimately reduce CO2 emissions.

Social and economic activities performed by humans cause emissions, and education is an important source that can positively alter human behavior (Jian et al. 2021). Moreover, education help raise the technical skills and capabilities that will improve human efficiency in all walks of life and contribute to economic development (Usman et al. 2021a, b). Experts of economics strongly agree that human capital is essential for the economic growth of a country in the long run. They also agree that human capital is a by-product of formal education, trained and experienced labor, and research and development, which are fundamental parts of the inputs used in the production function (Wang and Shao 2019). Most developing economies have replaced their labor-intensive production techniques with human capital intensive ones and achieved the economic goals with a more clean environment. However, the evidence suggests that education may pollute the environment in developing and emerging economies due to energy-intensive infrastructure and low economic development (Mahalik et al. 2021). The energy consumed by educational activities at various levels may differ at different stages of economic development (Inglesi-Lotz and Morales, 2017), which may form an inverted U-shaped relationship between education and CO2 emissions.

As far as the long-run relationship between economic development and environmental quality is concerned, we can see that the estimates attached to GDP are positive but insignificant in most models. However, the estimates attached to GDP2 are negatively significant in all models confirming the presence of an inverted U-shaped relationship between economic development and CO2 emissions in China. Such an inverted U-shaped relationship is known as the EKC of Grossman and Krueger (1995), which implies that the early part of economic growth pollutes the environment and later improves it. This result also implies that China is heading towards sustainable development, i.e., achieving economic growth without polluting the environment further. The estimate of the control variable of EC suggests that increased energy consumption causes the CO2 emissions to rise; however, the estimates of Internet and GE are significant and negative, implying that increased internet subscriptions and government expenditures cause the CO2 emissions to decline in China.

Finally, to confirm the efficiency of our estimates, some diagnostic tests are also outlined in Table 4. Firstly, Lagrange multiplier (LM) test confirms that our residuals are free from first-order serial correlation. Secondly, the Ramsey RESET test confirms that no misspecification is found in our model. Thirdly, Breusch Pagan (BP) test approves that the variance of the error terms is homoscedastic. Finally, the CUSUM and CUSUMSQ confirm the parametric stability of the models where “U” represents the stable parameters and “US” represents the unstable parameters. Sensitivity analysis is also done in order to gauge the robustness of empirical results by using greenhouse gas emissions as dependent variables. The findings of the robust model are given in Table 5. The effects of education variables and control variables remain consistent with the previous model findings as shown in Table 4. The robustness of the findings is confirmed through sensitivity analysis.

Table 5 ARDL estimates of greenhouse gas emissions (GHG)

Conclusion and implications

Human capital is a by-product of education, and it serves as an input in the production function. Many advanced economies have moved from labor-intensive production methods to human capital intensive, which has transformed their economies and helped them achieve sustainable development. Following the footprint of many developed economies, emerging markets and developing economies started to develop human capital through education, technical training, and skill development programs. As a result, even the production techniques in the emerging economies are now becoming more environmentally friendly and emitting fewer carbons. Education provides the base for human capital development; however, investment in education infrastructure may hurt the environmental quality at the earlier stages and improve at the later stages. Empirics also have linked the positive impact of education on the environmental quality with the level of economic development, i.e., the positive effects of education on the environment are visible at a higher level of economic development.

China is an emerging economy growing fast, and the role of human capital is on a high in China’s economic development. However, the role of different levels of education in the formation of EKC in China is underexplored. Therefore, the primary focus of this study is to evaluate the impact of various levels of education on the CO2 emissions in China. Moreover, the study also tested the EKC hypothesis for different levels of education and economic development. The analysis employed disaggregate and aggregate data for education that included enrollment at primary, secondary, and tertiary levels and the average year of schooling. For empirical analysis, we employed an error correction model and bounds testing approach to cointegration. The results of the study provided some useful information both in the short and long run. All the proxies of education positively impact CO2 emissions at the initial level both in the short and long run; however, when we take the square of these variables, the effects of education on CO2 emissions become negative. Similarly, the impact of economic growth on CO2 emissions is positive in the short and long run, and the square of economic growth on CO2 emissions is negative, supporting the EKC hypothesis. These findings imply that both education and economic growth follow an inverted U-shaped path while affecting the CO2 emissions, i.e., both education and economic growth degrades the environmental quality at the initial stages and improves at the later stages.

Based on the findings, we also provide some practical policy implications. Both education and development are interconnected, and both can affect the environmental quality via increased energy demand. Economic growth requires energy, which deteriorates the environmental quality at the initial stage due to energy-intensive infrastructure in the developing economies. Energy conservation policy may reduce CO2 emissions, but it can also deter China’s economic growth because China heavily relies on non-renewable energy consumption for its economic development. This problem can be addressed by depending more on renewable energy projects and increasing the amount of renewable energy in the total energy mix. On the other side, an increase in educational infrastructure also pushes the energy demand upward, which may deteriorate the environmental quality at the initial stages. Energy is an essential demand of educational institutions; therefore, a prudent energy policy is required to improve energy performance and efficiency in educational institutions. In this regard, an energy policy should mainly focus on evaluating energy performance and conservation, attaining decreases in fossil fuel expenses, investing more in renewable energy projects, and installing clean and green energy sources for educational institutions and projects.

Based on limitations, the study proposes some future directions. Firstly, this research is done only for China; similar research can be replicated for other developed and developing countries. Secondly, this study gauged the symmetric association between educational attainment and EKC; in the future, an asymmetric association between educational attainment and EKC can also be estimated. In the future, a similar study can be conducted for provincial and regional levels of China.