1 Introduction

Since the Industrial Revolution, although significant breakthroughs have been made in the development of human society, the deterioration of the Earth's ecological environment has also reached an unprecedented level. This is mainly reflected in two aspects: firstly, global climate change mainly caused by increased fossil fuel consumption (Anshasy et al., 2014; Khan et al., 2021; Rahman et al., 2022; Sarsa et al., 2021), and secondly, various damages to the natural environment caused by anthropogenic emissions of pollutants, such as marine (Yang et al., 2022; Alam et al., 2021; Dasgupta et al., 2022; Islam et al., 2022) and soil pollution (Shi et al., 2023; Zheng et al., 2023). According to IPCC predictions, the global average temperature is at least 1.5 ℃ higher than the pre-industrial benchmark (Allen et al., 2018). In order to achieve sustainable development of human society, it is necessary to strengthen environmental regulation (ER) to protect the environment. However, with the strengthening of ER, certain production activities will inevitably be negatively affected in the short term (Xie et al., 2023; Liu et al.,2022; Sheng et al., 2022; Qiang et al., 2021). Therefore, it is often necessary to balance environmental protection with economic development. So, are there any production activities that can still promote economic development under high-intensity ER? If so, what type of industry does it belong to, and why is it possible? It is necessary to find these productive activities that have the potential to achieve sustainable development of the economy.

In recent years, countries worldwide have continuously strengthened ER (Li et al., 2022; Perino et al., 2012; Morgan et al., 2023; Depren et al., 2022). The most significant milestones were the 1992 United Nations Framework Convention on Climate Change, the 1997 Kyoto Protocol, and the 2015 Paris Agreement. China has also taken on the burden of reducing pollution and carbon as a world power. In September 2013, China first launched the "Air Pollution Prevention and Control Action Plan" with the "Ten Measures for the Atmosphere", which set a target of reducing the concentration of inhalable particulate matter by 10%. Subsequently, in 2018, the "Three Year Action Plan for Winning the Blue Sky Defense War" and the "Implementation Regulations on Environmental Protection Tax Law" were released to implement environmental protection goals further and improve the environmental regulatory system. However, surprisingly, China has supported an annual economic growth rate of 6.5% with an average annual energy consumption growth rate of 3% over the past decade, and its energy intensity has decreased by 26.2% (See Fig. 1), making it one of the countries with the fastest reduction in energy intensity in the world. This has also led many scholars to study why China can achieve rapid economic development while maintaining high-intensity ER. It includes many aspects, such as the relationship between China's energy consumption and economic growth (Zhang et al., 2011), the impact of coal use on carbon dioxide emissions and economic growth (Kartal et al., 2023), the relationship between China's ER and economic growth (Lu et al., 2022; Ma et al., 2022), the impact of economic growth and energy consumption on environmental degradation in China (Nurgazina et al., 2022), the determinants of China's carrying capacity (Pata et al., 2021), the impact of reducing fossil energy intensity on China's ecological efficiency (Pata et al., 2021), the impact factors of China's carbon emissions (Liu et al., 2022), testing the effectiveness of the Environmental Kuznets Curve (EKC) hypothesis for China (Yılancı et al., 2020), and so on. and further focus on the production level (Ju et al., 2022; Zhang et al., 2018; Li & Tao, 2020; Yuan et al., 2017; Huang et al., 2022).

Fig. 1
figure 1

Energy intensity and economic development trends in China

There is no doubt that studying China's past can be beneficial in identifying productive activities with the potential for sustainable development. However, as far as the existing studies about production-level are concerned, the vast majority of them focus only on a single or a small number of specific industries. Even so far, there are still some industries that have not been studied. Given this, this study provides an idea to provide a unified comparative framework through the division of the three industries (2003) and use the total factor productivity (TFP) of the three industries as an entry point to explore in depth at the production level why China has been able to reconcile environmental protection with rapid economic development, and which industries contribute to this. Therefore, this study aims to search for production activities with the potential for sustainable development through a complete research framework and to analyze why they can achieve development under high-intensity ER and what other characteristics exist.

To sum up, this study uses the data of 31 provinces in the Chinese Mainland from 2011 to 2020, considering both static and dynamic panels, and using two-way fixed effects and system GMM and other models, analyzing the relationship between ER and TFP of the three industries to find production activities with the potential of sustainable development. Furthermore, based on existing literature, analyze the pathways and other characteristics of ER affecting the TFP of the three industries. Therefore, the potential contributions of this paper are: (1) Based on a complete analytical framework, the differences in the impact of ER on the three industries are analyzed, which facilitates the search for production activities with the potential of sustainable development. (2) Based on the connotation of generalized cost in the new institutional economics, the interaction between the production cost effect (PCE), compensation effect of innovation (CEI) and compensation effect of optimizing resource allocation (CEORA) caused by ER is considered, and the mechanism of ER on the TFP of three industries is clarified further. (3) The nonlinear effects of ER on TFP in three industries under the empowering effect of the DE are analyzed. (4) The differences in the effects of ER on the TFP of the three industries in different geographical locations are analyzed, which provides a reference for local governments to formulate policies further. In addition, the novelty of this study is that (1) compared with previous fragmented studies at the production level, all industries are included in the research framework based on the division of the three industries (2) Based on the impact of ER on PEA, a generalized production cost analysis method is provided.

The remainder of this paper is organized as follows. The second part is the literature review; the third part is the study design; the fourth part is the analysis of results, which includes benchmark analysis, mechanism analysis, threshold analysis and heterogeneity analysis; the fifth part is the discussion. The sixth part is the conclusion and recommendation.

2 Literature review and research objectives

The concept of ER was first introduced by Dasgupta (1980), who argued that ER is simply a push–pull effect of government policies (i.e., policies or coercive instruments developed by the government to balance economic development and ecological environment to reduce external diseconomies caused by pollutant emissions). Subsequently, Michael Porter (1990) found that the countries with the most competitive products in the world (Germany, Switzerland, Japan, and the United States) have not only good product quality but also rank high in environmental quality. As a result, Michael Porter came up with the famous "Porter Hypothesis" (1995), which suggests that appropriate ER can lead to more innovative activities. These innovative activities would increase firms' productivity, thus offsetting environmental protection costs and further increasing their profitability in the market (i.e., CEI). In the same year, Panayotou (1993) drew inspiration from the inverted U-shaped curve between per capita income and income inequality defined by Kuznets and proposed the Environmental Kuznets Curve Hypothesis (EKC). EKC indicates an inverted U-shaped relationship between environmental quality and economic development, which pioneered research on coordinating environmental protection and economic development. Subsequently, academic research on ER continued to deepen based on CEI in the Porter hypothesis. Shenggang Ren et al. (2018) combined their previous theoretical studies to classify ER into three categories: mandatory ER, market-based ER, and voluntary ER. In the same period, Peng Wenbin et al. (2017) classified ER into formal and informal ER based on the thinking of new institutional economics. It is precisely based on these studies that the continuous exploration of the connotation of ER enables many scholars to analyze the impact of ER on various production activities. As the research object of this study is China, Table 1 summarizes the relevant research on China.

Table 1 Relevant research on China

Previous research has focused chiefly on agriculture and industry and has rarely involved the tertiary industry. From the current situation, the impact of ER on green TFP in agriculture and industry is inverted U-shaped, which means that these two industries may have significant sustainable development potential. However, it is undeniable that most of the existing research is insufficient, and even some industries have not been studied. In order to find more production activities with sustainable development potential, it is necessary to locate their industry categories through a large analytical framework initially. This helps to reveal the impact of ER on various production activities on a larger scale, thus providing an essential basis for the government to formulate appropriate policies. Therefore, this study is expected to contribute to coordinating environmental protection and economic development.

3 Research design

This study investigates the differences in the effects of ER on TFP in three industries by estimating a static panel and a dynamic panel separately through a two-way fixed effects model and a systematic GMM model. This helps to search for production activities with sustainable development potential directly. Then, based on the existing literature and empirical results, analyze how ER indirectly affects TFP in the three industries by influencing public environmental attention (PEA) and foreign direct investment (FDI). In addition, based on the effects of DE development on PEA and FDI, a threshold model is developed to analyze the potential nonlinear effects of ER on the TFP of the three industries under different levels of DE. Finally, the differences in the effects of ER on the TFP of the three industries in different geographical locations are further analyzed.

3.1 Variable selection

3.1.1 Explained variable

The interpreted variable in this study is the TFP of the three industries, measured by the super-efficient SBM-DEA model (Tone, 1997). This model solves the problem of slackness of decision units, avoids the bias caused by direction and angle problems, and can compare the efficiency differences between frontier surfaces in more detail so that the efficiency value can be greater than one, which is more conducive to the subsequent analysis. The input variables selected in this paper use the number of employees and the fixed capital stock of each industry, and the output variables use the gross product of each industry. The fixed capital stock of each industry is calculated using the perpetual inventory method by combining the methods of Xu Xianxiang (2007), Zong Zhenli (2014), and Shan Haojie (2008). Equation (1) shows the super-efficient SBM-DEA model.

$$ \begin{gathered} \quad \quad \quad \quad \theta^{*} = \mathop {\min }\limits_{{\lambda ,s^{ - } ,s^{ + } }} \frac{{1 + \frac{1}{m}\sum\nolimits_{i = 1}^{m} {\frac{{s_{i}^{ - } }}{{x_{io}^{t} }}} }}{{1 - \frac{1}{q}\sum\nolimits_{r = 1}^{q} {\frac{{s_{r}^{ + } }}{{y_{ro}^{t} }}} }} \hfill \\ s.t.\;\left\{ \begin{gathered} x_{io}^{t} \ge \sum\limits_{t = 1}^{T} {\sum\limits_{j = 1,j \ne o}^{n} {\;\lambda_{j}^{t} x_{ij}^{t} - s_{i}^{ - } \quad \left( {i = 1,\;\;2,\;\; \ldots \;\;,\;\;m;} \right)} } \hfill \\ y_{ro}^{t} \le \sum\limits_{t = 1}^{T} {\sum\limits_{j = 1,j \ne o}^{n} {\lambda_{j}^{t} y_{rj}^{t} + s_{r}^{ + } \quad \left( {r = 1,\;\;2,\;\; \ldots \;\;,\;\;q;} \right)} } \hfill \\ \sum {\lambda_{j}^{t} = 1} ,\;\;\lambda_{j}^{t} \ge 0\left( {\forall j} \right),\;\;s_{i}^{ - } \ge 0\left( {\forall i} \right),\;\;s_{r}^{ + } \ge 0\left( {\forall r} \right) \hfill \\ \end{gathered} \right. \hfill \\ \end{gathered} $$
(1)

where \(\theta^{*}\) is the optimal solution of the model (i.e., the efficiency value); \(i\) represents the input variables; \(r\) represents the output variables; \(t\) represents the year; \(s_{i}^{ - }\) is the slack variables for the inputs; \(s_{r}^{ + }\) is the slack variables for the outputs; \(s.t.\) represents the constraints; \(x_{io}^{t}\) represents the input variables for the \(o\) th decision unit (here, the province) in year \(t\); \(y_{ro}^{t}\) represents the output variables for the \(o\) th decision unit in year \(t\); \(\lambda_{j}^{t}\) is the target parameter to be solved for; and \(\sum {\lambda_{j}^{t} = 1}\) represents the scale variable payoff. The variables required for the calculations and their meanings are shown in Fig. 2. Appendices 1, 2, and 3 provide the TFP trends of three industries in various provinces of China from 2011 to 2020.

Fig. 2
figure 2

Computational design framework for three industries' TFP

3.1.2 Explanatory variable

The explanatory variable in this study is the intensity of ER, which refers to the strength of the government in combating various types of environmental pollution. Most existing literature uses variables such as environmental governance investment and pollution emissions (Guo et al., 2022; Wang et al., 2023) to measure the level of ER. This can to some extent reflect the intensity of ER, but the connotation of ER should not be limited to a specific part. In addition, taking the investment in environmental governance as an example, it depends more on the local economic development level. Thus endogeneity issues will inevitably arise when making econometric estimates.

To avoid the above issues, this study draws on the approach of scholars such as Chen Shiyi (2018) and takes the work reports of provincial and municipal governments as the starting point. Through Python, crawls the vocabulary related to ER and calculates their proportion to reflect the intensity of ER. Because the government work report is an important manifestation of the goals and results of governing society, it can reflect the government's focus and achievements in a certain period in an all-around way. In order to more comprehensively reflect the connotation of ER in this variable, 48 relevant terms have been determined after careful consideration, as shown in Table 2.

Table 2 Vocabulary related to ER

To verify the rationality of the variables, the changes in word frequency kernel density between 2012 and 2014 were plotted, as shown in Fig. 3. As shown in the figure, the frequency of words has been increasing yearly, which is highly consistent with China's increasingly strict ER during this period. Therefore, choosing this variable to measure ER is reasonable.

Fig. 3
figure 3

Kernel density map of ER word frequencies

3.1.3 Mechanism variable

The mechanism variables selected in this study are PEA and FDI, measured by the Baidu Index and the proportion of FDI to GDP, respectively. Today, the internet can reflect public attention to various fields. Therefore, the data of search engines can comprehensively reflect the PEA. This study draws inspiration from the approach of Wu Libo et al. (2022), using "environmental pollution" as a search term to collect daily average data and sum it up to obtain annual index data.

3.1.4 Threshold variable

The threshold variable selected in this article is DE. Referring to the approach of Chuanyu Zhao et al. (2023), the development level of DE is measured using the level of internet development and the degree of financial inclusion. The specific indicators are shown in Table 3. Then, the principal component analysis method is used to condense all variables into one variable to represent the level of DE. The inspection results of the principal component analysis method are shown in Table 4.

Table 3 Indicators for measuring the level of DE
Table 4 Test results of the principal component analysis method

As shown in Table 4, the KMO value > 0.6, and Bartlett's test rejects the original hypothesis, indicating a correlation between the indicators and is suitable for principal component analysis.

3.1.5 Control variable

By referring to existing literature and combining actual situations, the following control variables were ultimately selected in this study:

  1. 1.

    Advanced level of human capital: Advanced level of human capital refers to the continuous improvement of a person's comprehensive literacy level. This study uses the ratio of the number of people in higher education to the total number of people to measure. There are many advantages to improving people's comprehensive literacy, such as innovation (Fonseca et al., 2019; Hu, 2021; Ma et al., 2019; Mariz-Pérez et al., 2012) and environmental protection (Pata et al., 2023). Generally speaking, having higher comprehensive qualities can enable them to master new skills faster and apply them to the production activities of various industries, thereby effectively improving the TFP of each industry.

  2. 2.

    Level of factor market development: Factor market, i.e., the market of production factors, is one of the main conditions for playing a role in allocating market resources. On the one hand, the better developed the factor market is, the better it is for the industries to procure factors in their production activities and thus effectively maintain the production patterns of each enterprise and avoid losing unnecessary additional adjustment costs (Qiao et al., 2021). Eventually, the TFP of each industry is indirectly improved. This study uses the factor market development score from the China Marketization Index database to measure each province's factor market development level.

  3. 3.

    Level of investment in fixed assets: Investment in fixed assets is the cost of building the items necessary for productive activities. More and better investment in fixed assets improves the production environment of each industry, which in turn increases the TFP of each industry (Liu et al., 2010). This study uses the ratio of fixed asset investment in each industry to GDP to measure each industry's fixed asset investment level.

  4. 4.

    Financial efficiency: Financial efficiency is the efficiency with which funds are transferred from the redundant to the demander of funds. Improving financial efficiency can effectively reduce the cash flow pressure of business operations, which can maintain the survival of enterprises, provide security for their production activities, and ultimately improve the TFP of industries (Liu et al., 2021). This study selects the ratio of bank loans to deposits in each province to measure financial efficiency. The larger the value of this variable, the more efficient the bank's working capital is.

  5. 5.

    Level of technological progress: The impact of technological progress on TFP is self-evident (Si & Wang, 2011). However, technological progress is often difficult to measure, and previous studies usually use R&D inputs to measure technological progress. However, the level of progress from the input perspective to represent the output is more or less inconsistent with reality. Therefore, this paper uses the number of granted invention patents to measure the level of technological progress in each province in the current period. The granted patents indicate that the technology has tended to be complete, while the invention patents are more representative of improving technology levels and technological breakthroughs.

3.2 Research methodology

3.2.1 Baseline regression model

To explore the impact of ER on TFP in three industries, this study uses static and dynamic panels for analysis. On the one hand, production patterns are difficult to change in a short period, and there is path dependence or pattern inertia, such as the adjustment of capital stock. Therefore, the production efficiency of each industry will be influenced by the past in a certain period in the future; on the other hand, certain factors have not only a current impact on production efficiency but also may have a lagged impact. For example, new technologies often play a minor role in the initial input stage. They have to go through a certain period of application improvement before significantly improving each industry's production efficiency. Therefore, it is necessary to consider the dynamic relationship between variables to simulate the actual situation for modeling better. To this end, the following static and dynamic panel models (i.e., using two-way fixed effects estimation and system GMM estimation models (Blundell et al., 1998)) are constructed as shown in Eqs. (2) and (3), respectively.

$$ Tfp_{ijc} = \alpha + \beta_{1} Er_{ij} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} $$
(2)
$$ Tfp_{ijc} = \alpha + \beta_{2} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} $$
(3)

Further considering the lag effect of new technologies on the dynamic model (controlling the fluctuations caused by technological progress variables lagging for one and two periods), Eq. (3) is converted to Eq. (4).

$$ Tfp_{ijc} = \alpha + \beta_{2} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \beta_{4} Tp_{ij}^{t - 1} + \beta_{5} Tp_{ij}^{t - 2} + \mu_{i} + \lambda_{j} + \varepsilon_{ij} $$
(4)

where \(i\) denotes the time label; \(j\) denotes the province label; \(c\) is the industry label, with values of 1, 2 and 3 corresponding to the primary, secondary and tertiary industries, respectively. \(p\) is the control variable label, with different values (1, 2, 3, 4, and 5) representing different control variables. \(Tfp_{ijc}\) denotes the industry TFP; \(Tfp_{ijc}^{t - 1}\) denotes the industry TFP with one lag; \(Control_{ijp}\) denotes the set of control variables; \(Tp_{ij}^{t - 1}\) and \(Tp_{ij}^{t - 2}\) denote the lagged one-period and \(\beta_{1}\), \(\beta_{2}\), \(\beta_{3}\), \(\beta_{4}\), and \(\beta_{5}\) are all coefficients to be estimated; \(\beta^{\prime}_{p}\) denotes the set of coefficients to be estimated for each control variable; \(\mu_{i}\) and \(\lambda_{j}\) denote time and province fixed effects, respectively; and \(\varepsilon_{ij}\) denotes the random disturbance term.

3.2.2 Threshold model

In order to explore the differences in the impact of ER on the TFP of three industries under different levels of DE, a threshold regression model (Seo et al., 2016) was established, taking into account both static and dynamic panels respectively. The estimation models for the three thresholds are shown in Eqs. (5) and (6). In this section, determine whether there is a threshold based on the threshold test results, then perform threshold regression.

$$ Tfp_{ijc} = \left\{ \begin{gathered} \alpha + \beta^{\prime\prime}_{1} Er_{ij} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad {\text{Dlde}}_{ij} \le \gamma_{1} \hfill \\ \alpha + \beta^{\prime\prime}_{2} Er_{ij} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{1} {\text{ < Dlde}}_{ij} \le \gamma_{2} \hfill \\ \alpha + \beta^{\prime\prime}_{3} Er_{ij} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{2} {\text{ < Dlde}}_{ij} \le \gamma_{3} \hfill \\ \alpha + \beta^{\prime\prime}_{4} Er_{ij} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{3} {\text{ < Dlde}}_{ij} \hfill \\ \end{gathered} \right. $$
(5)
$$ Tfp_{ijc} = \left\{ \begin{gathered} \alpha + \beta^{\prime\prime}_{1} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \beta_{4} Tp_{ij}^{t - 1} + \beta_{5} Tp_{ij}^{t - 2} + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad {\text{Dlde}}_{ij} \le \gamma_{1} \hfill \\ \alpha + \beta^{\prime\prime}_{2} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \beta_{4} Tp_{ij}^{t - 1} + \beta_{5} Tp_{ij}^{t - 2} + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{1} {\text{ < Dlde}}_{ij} \le \gamma_{2} \hfill \\ \alpha + \beta^{\prime\prime}_{3} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \beta_{4} Tp_{ij}^{t - 1} + \beta_{5} Tp_{ij}^{t - 2} + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{2} {\text{ < Dlde}}_{ij} \le \gamma_{3} \hfill \\ \alpha + \beta^{\prime\prime}_{4} Er_{ij} + \beta_{3} Tfp_{ijc}^{t - 1} + \sum\limits_{p}^{5} {\beta^{\prime}_{p} Control_{ijp} } + \beta_{4} Tp_{ij}^{t - 1} + \beta_{5} Tp_{ij}^{t - 2} + \mu_{i} + \lambda_{j} + \varepsilon_{ij} ,\quad \gamma_{3} {\text{ < Dlde}}_{ij} \hfill \\ \end{gathered} \right. $$
(6)

where \(\beta^{\prime\prime}_{1}\), \(\beta^{\prime\prime}_{2}\), \(\beta^{\prime\prime}_{3}\) and \(\beta^{\prime\prime}_{4}\) denote the estimated coefficients of the threshold variables at different regions; \(\gamma_{1}\), \(\gamma_{2}\) and \(\gamma_{3}\) denote different threshold values; \({\text{Dlde}}_{ij}\) denotes the level of digital economic development; and the rest of the symbols are the same as in Eq. (4).

3.3 Sample and data

In this paper, 31 provinces in mainland China from 2011 to 2020 are selected as the research sample, the data used are obtained from various Chinese statistical yearbooks, and a few are obtained through China WIND, EPS and other databases. For the data on fixed asset capital stock, the data on fixed asset investment by industry from 2010 to 2016 are obtained from the China Statistical Yearbook of Fixed Asset Investment 2011–2017. The data for 2013 are missing and found in the China Statistical Yearbook 2014. The data on fixed asset investment by industry in 2017 were obtained from the China Statistical Yearbook 2018. The data on fixed asset investment for 2018–2020 are missing, so the investment data are calculated using the "growth of fixed asset investment by industry by region over the previous year" in the China Statistical Yearbook. The 2021 Statistical Yearbook no longer provides the price index for fixed asset investment, so the consumer price index (CPI) is used instead. The word frequency of ER is obtained from each province and city's local government work reports by summing the total number of words in each province and calculating the percentage. The factor market development scores are obtained from the China Marketization Index database, and the financial inclusion index is obtained from the Digital Finance Center of Peking University. In addition, some missing data are filled by linear interpolation. More importantly, the Jarque Bera statistics of many variables in this study are large, which means that the original hypothesis of the normal distribution is rejected. Therefore, this study takes logarithm or co-multiplication for non-ratio variables, which is significant for stabilizing variance and eliminating nonlinear function forms, thus improving Goodness of fit. The definitions and descriptive statistics for each variable are shown in Table 5.

Table 5 Definition and descriptive statistics of each variable

4 Analysis of result

4.1 Baseline regression analysis

The results of the static panel model (i.e., the model estimated using two-way fixed effects) are presented in Table 6, where items (1), (3), and (5) represent the estimated results of the impact of ER on TFP in the three industries when no fixed effects are added, respectively, and items (2), (4), and (6) represent the estimated results of the impact of ER on TFP in the three industries when fixed effects are added, respectively.

Table 6 Estimation results of the static panel model

From Table 6, without controlling variables, ER significantly affects TFP in both primary and tertiary industries, except for the secondary industry. After adding the control variables, the coefficients of ER on TFP in the primary industry are significant at the 1% level and −0.05; on TFP in the secondary industry, are significant at the 5% level and 0.016; and on TFP in the tertiary industry are significant at the 1% level and −0.019. This indicates that ER can promote TFP in the secondary sector but also inhibits TFP in the primary and tertiary sectors. In addition, ER has a more significant inhibitory effect on the primary industry than on the tertiary industry. This is different from the subconscious result, which tends to assume that ER has some negative impact on the TFP of the secondary sector, so we consider the dynamic panel scenario immediately afterward.

The results of the dynamic panel model (i.e., the model estimated using the system GMM) are shown in Table 7. This study estimates the \(t\)- statistic instead of the \(z\)-statistic. The \(F\)-test is used instead of the Wald-cardinal test for the overall model fit, which improves the estimation accuracy when the sample is small. Among them, items (1), (4), and (7) represent the estimated results of the impact of ER on the TFP of each industry without adding control variables, respectively; items (2), (5), and (8) represent the estimated results of ER on the TFP of each industry when controlling variables are added, respectively; items (3), (6), and (9) represent the estimated results of the impact of ER on the TFP of each industry after incorporating control variables and the lagging impact of technological progress, respectively. L. and L.2 represent variables with one and two lag periods, respectively.

Table 7 Estimation results of dynamic panel model

From Table 7, the second-order autocorrelation test (i.e., AR(2)-test-p) of the difference of the disturbance terms of each model has a p value greater than 0.1, indicating that each model does not reject the original hypothesis of "no second-order autocorrelation of the difference of the disturbance terms," so each model satisfies the autocorrelation test of the disturbance terms of the system GMM model, i.e., the disturbance terms are not autocorrelated. In the over-identification test for instrumental variables (i.e., Hansen-test-p), the p values of all models are more significant than 0.1, indicating that the original hypothesis of "all instrumental variables are valid" is not rejected. Furthermore, the p values are less than 0.25, indicating that all models are not invalidated due to the excessive selection of instrumental variables. So the probability of all models being effective is at least 90%.

More importantly, the first-order lag term of the dependent variable is significant, at least at the 1% level, indicating that considering the inertia of production activities is reasonable. In addition, the estimated coefficients of the bidirectional fixed effects model are generally underestimated, while the absolute value of the estimated coefficients of the system GMM model is significantly greater than that of the bidirectional fixed effects model. Therefore, the rationality of the model was indirectly verified.

Most importantly, the estimation results of dynamic and static panels are consistent. Specifically, all the core explanatory variables are significant, at least at the 5% level, indicating that ER has some impact on TFP in all three industries. Among them, ER positively impacts the TFP of the secondary industry in all models; In contrast, ER negatively impacts the TFP of both the primary and tertiary industries. Furthermore, ER has a more significant negative impact on the primary industry than on the tertiary industry.

4.2 Mechanism analysis

This section analyzes the impact paths of ER on the TFP of the three industries from two aspects: PEA and FDI.

Specifically, this study first demonstrates the impact of ER on PEA and FDI through empirical analysis in the first stage and then uses theoretical analysis in the second stage to demonstrate how PEA and FDI further affect TFP in the three industries in the context of the actual situation. The empirical results of the mechanism analysis are presented in Table 8. The model is consistent with the baseline regression, except that the explained variables are replaced with PEA and FDI. Items (1) and (2) represent the static panel estimation results of ER on PEA; items (3), (4), and (5) represent the dynamic panel estimation results of ER on PEA. Besides, items (6) and (7) represent the static panel estimation results of the FDI by ER; items (8), (9), and 10) represent the dynamic panel estimation results of the FDI by ER.

Table 8 Empirical results of the first stage of mechanism analysis

From Table 8, in the first stage, all models passed the autocorrelation tests of the nuisance term and over-identification tests of the system GMM. The estimation results show that all models are significantly positive, at least at the 5% level, when PEA is the explained variable, and all models are significantly negative at the 5% level when FDI is the explained variable, except for the static panel model with no control variables. Therefore, the results indicate that ER can increase PEA and reduce FDI in China.

Subsequently, this study will proceed with the second stage of argumentation based on the actual situation. Previous studies have mainly focused on the penalty costs incurred by enterprises in discharging pollutants, making it easy to overlook the negative impact of ER on the primary and tertiary industries. This study is based on the connotation of generalized cost in new institutional economics, incorporating mental cost to comprehensively reveal PCE caused by ER. Firstly, the government's strengthening of ER has increased PEA, which has been proven in the first stage. Therefore, workers will pay more attention to environmental behaviors in production activities (because they will be subject to self-moral constraints or other punishments). Then, these other environmental behaviors will increase the enterprise's cost of production (Because workers consume mental and physical costs in other environmental behaviors). Finally, ER lead to a decrease in the TFP of various industries. However, due to CEI generated by the secondary industry, ER has actually increased the TFP of the secondary industry.

On the other hand, the government has strengthened ER, thus reducing FDI, as demonstrated in the first phase. This is because intense ER has reduced the profitability of pollution-intensive firms that rely on external diseconomies, reducing FDI quickly. However, due to transfer costs, these reduced investments will continue to remain in China. Finally, these stray investments will be placed in under-resourced areas to create more value. So, in addition to PCE, high-strength ER also generates CEORA, which is conducive to improving the TFP of various industries.

Based on the above analysis, this study believes that the results presented by benchmark regression are a comprehensive effect of PCE, CEI, and CEORA. Due to the need for more time to fully generate CEORA, there was only CEI generated by the secondary industry during this period. Therefore, the total compensation effect cannot compensate for PCE for the primary and tertiary industries, making ER's impact on the primary and tertiary industries negative during this period. In contrast, the compensation effect generated by the secondary industry is greater than PCE, making ER' impact on the secondary industry positive during this period.

4.3 Threshold analysis

After analyzing the impact mechanism, this study further considers the potential nonlinear impact of ER on the TFP of various industries. Since the DE can remove information barriers and thus influence PEA and FDI, it is necessary to investigate further whether ER has a nonlinear effect on TFP in the three industries at different levels of DE.

In this section, this study first verifies the impact of DE on PEA and FDI through the model of benchmark regression. Then it uses threshold regression to analyze whether ER has a nonlinear impact on the TFP of the three industries. The estimated results of the impact of DE on the mechanism variables are shown in Table 9. Among them, Items (1), (2), (3), and (4) represent the estimated results of the impact of DE on PEA; Items (5), (6), (7), and (8) represent the estimated results of DE on FDI.

Table 9 Results of estimation of threshold variables on mechanism variables

From Table 9, it can be seen that the GMM model of the system has passed the autocorrelation test and over-identification test of the disturbance term. More importantly, the estimated results of all models are significantly positive at the level of at least 5%, indicating that developing DE can increase PEA and FDI. Therefore, considering the impact of DE on mechanism variables, this study continues to analyze the nonlinear impact of ER on the TFP of the three industries. Specifically, we first identify whether there is a threshold when DE is used as a threshold variable and then conduct a threshold regression to analyze the nonlinear effects of ER on TFP in three industries. The recognition results of the threshold are shown in Table 10.

Table 10 Identification results of threshold effects

From Table 10, it can be seen that when DE is used as a threshold variable, the impact of ER on the TFP of the primary industry does not have a threshold. In addition, since there is no threshold for the impact of ER on the TFP of the tertiary industry in the dynamic panel, it can be considered that ER has no non-linear impact on the TFP of the primary and tertiary industries. In contrast, the impact of ER on the TFP of the secondary industry has two thresholds at the 10% level. Appendices 4, 5, 6, and 7 provide visualizations of thresholds. Specifically, there are two thresholds when DE is approximately 1 and 2.4. Therefore, the non-linear effects of ER on the TFP of the secondary industry under different levels of DE are analyzed next, and the results are presented in Table 11. Among them, items (1) and (2) represent the estimated results of the static panel, while items (3), (4), and (5) represent the estimated results of the dynamic panel.

Table 11 Estimation results of the threshold regression

From Table 11, it can be seen that when only considering significant estimation results, regardless of whether the control variable is controlled or not, the estimation results of the static panel model before and after each threshold are significantly positive at the level of at least 5%, and the estimation coefficient continuously increases with the appearance of the threshold. Specifically, (1) the estimated coefficient of the item changes from the initial 0.155 to 0.520, and (2) the estimated coefficient of the item changes from the initial 0.122 to 0.247 and then to 0.550. In addition, the estimation results of the dynamic panel model also exhibit the same trend as the static panel. Therefore, developing DE can significantly amplify the positive impact of ER on the TFP of the secondary industry. At the same time, this indirectly confirms the correctness of previous scholars' findings that developing DE can promote economic development (Guo et al., 2023; Myovella et al., 2020). Because developing DE can amplify CEI generated by the secondary industry, thereby ensuring economic development.

4.4 Impact heterogeneity analysis

In addition to human factors, natural factors such as geographical location often potentially impact the development of different regions. Therefore, it is necessary to explore the heterogeneity of the effects of ER on the TFP of the three industries in different regions. The results of the heterogeneity analysis are shown in Table 12. Considering that the sample size of the heterogeneity analysis is too small and the number of provinces size is too close to the length of the time series (i.e., N≈T), it is not appropriate to use either the system GMM or the bias-corrected LSDV method to estimate the dynamic panel at this time, so only the two-way fixed effects model is used in this section to analyze the static panel.

Table 12 Estimated results of regional impact heterogeneity of ER

Table 12 shows that the estimated TFP of the primary industry in the eastern region by ER is significantly −0.310 at the 10% level, while the estimated TFP of the primary industry in the central region by ER is significantly 0.382 at the 5% level. However, the estimated TFP of the primary industry in the western region by ER is insignificant. This indicates that ER has a negative impact on the TFP of the primary industry in the eastern region but a positive impact on the TFP of the primary industry in the central region. In addition, the estimated results of ER on the TFP of the secondary industry in all geographical locations are significantly positive at the level of at least 5%, and the magnitude of the estimation coefficient indicates that the western region is greater than the eastern region and ≈ the central region. This indicates that ER can significantly improve the TFP of the secondary industry, and its positive impact on the western region is more significant than that of the central and eastern regions. Finally, the estimated results of ER on the TFP of the tertiary industry in each region are insignificant, which may be related to the small sample size.

5 Discussion

The complete framework of results can be inferred from the above (see Fig. 4). Previous studies have mainly used CEI to explain the shift in the impact of ER on TFP in some industries. This study additionally proposes CEORA and PCE to comprehensively analyze the impact and mechanism of ER on TFP in three industries and further consider the enabling role of DE. It is worth mentioning that this study only uses one stage of empirical analysis in the mechanism analysis section because there are no variables that can reflect the pure value of a single effect (for example, CEORA should have been reflected by investments that cause external diseconomies).

Fig. 4
figure 4

The complete framework of results

The regression results for this period show a negative impact of ER on FDI, which implies that CEORA is not yet fully formed (only when the impact is U-shaped or positive can it reflect the formation of CEORA since the transfer of investment is a process that decreases and then increases). Whereas strengthening ER will undoubtedly boost PEA, which in turn will generate more PCE. Therefore, CEORA in the primary and tertiary industries cannot compensate for PCE in that period (i.e., the impact of ER on TFP is negative). However, the secondary industry is forced to generate CEI through technological innovation due to urgent survival issues, making the secondary industry's CEI plus CEORA greater than PCE (i.e., ER positively affects TFP). In addition, the development of DE promotes both CEORA, CEI and PCE. From the regression results, the promotion effect of DE on CEORA and CEI is significantly greater than that of PCE. This is because the higher the degree of DE development, the greater the positive impact of ER on TFP in the secondary industry. Although in the short term, ER can only enhance the TFP of the secondary industry. However, in the long run, ER undoubtedly can reduce external diseconomies and thus improve the overall TFP of society (as CEORA will continue to increase to the point where it eventually surpasses PCE). On the other hand, the positive impact of ER on the TFP of the primary industry in the eastern region confirms this trend. This is reflected in the fact that the CEORA caused by ER to the primary industry has gradually started to exceed the PCE. (Figs. 5, 6, 7, 8, 9, 10 and 11)

6 Conclusions and recommendations

This study uses the data of 31 provinces in the Chinese Mainland from 2011 to 2020, considering both static and dynamic panels, and using two-way fixed effects and system GMM and other models, analyzing the relationship between ER and TFP of the three industries to find the production activities with the potential of sustainable development. The results indicate that ER positively impacts the TFP of the secondary industry. However, ER also hurt the TFP of the primary and tertiary industries, and the negative impact on the primary industry is more significant than that on the tertiary industry. What is more, this results from the comprehensive effect of PCE, CEI, and CEORA caused by ER. In addition, the higher the level of development of DE, the more significant the positive impact of ER on the TFP of the secondary industry. Most importantly, the impact of ER on the TFP of the primary and secondary industries exhibits significant regional heterogeneity.

The results of this study provide important insights for policymakers to coordinate environmental protection and economic development. From the research results, there is no doubt that the core reason why China has maintained strong ER over the past decade while still achieving rapid economic development is the improvement of TFP in the secondary industry. Furthermore, this also benefits from the development of DE. Therefore, industries in the secondary industry have tremendous potential for sustainable development, and the government should pay more attention to these industries. Specifically, the government should further strengthen the secondary industry's environmental supervision and law enforcement and improve environmental protection regulations and standards. Of course, green transformation must be gradual. Otherwise, it will directly affect the survival of these enterprises. Secondly, the government should continue encouraging enterprises to produce green products, provide environmental services, and promote green technologies and low-carbon equipment. The most important thing is that the government should provide more subsidies and investment for low-carbon and renewable energy industries, as these industries are the focus of future development.

In addition, over the past decade, ER have hurt the TFP of both the primary and tertiary industries due to the compensation effect being smaller than PCE. In order to reverse this situation, it is necessary to reduce cost effects and accelerate the generation of compensation effects. Specifically, ER increase PEA, resulting in additional production costs (including mental and physical costs) for various industries. Therefore, the government and industry leaders must establish a unified green production model and regulate environmental behaviour. This helps to reduce the uncertainty caused by ER and improve the work efficiency of workers. In terms of CEI caused by ER affecting the transfer of FDI, the government should guide investment through targeted support policies. This is beneficial for reducing information asymmetry during investment and accelerating resource allocation optimisation. More importantly, enterprises should fully use the advantages of DE to eliminate information barriers through Digital transformation. Finally, in response to the regional heterogeneity of the impact of ER on the TFP of the three industries, local governments should formulate policies tailored to local conditions instead of blindly following suit. Because natural conditions such as geographical location can lead to potential differences among different regions, it is advisable to find commonalities and explore development paths that are suitable for the local area.

In summary, this study identified the main driving force behind China's green development over the past decade, which directly contributes to adjusting the focus of production activities in the future to achieve a balance between the environment and the economy. However, this study still has some limitations. Firstly, due to data sample limitations, this study did not analyze the potential U-shaped impact of ER on the TFP of the three industries over a long period. From previous literature, the primary and secondary industries are most likely to have this characteristic. Secondly, this study did not explore the spatial spillover effect of ER on the total factor productivity of the three industries. More importantly, this study mainly focuses on China. Therefore, it is recommended to enrich research in this area in the future.