Keywords

10.1 Introduction

Sustainable development is the development that meets the needs of the present generation without compromising the ability of future generations to meet their own needs, which is defined by the Brundtland commission in 1987 (Brundtland et al. 1987). Here, we define sustainability risks as the factors that can pose risks to achieve a better and more sustainable future of a given system (e.g., an economy, a corporation, and a community). These sustainability risks can be related to economy, environment, and society. Resource-exhausted cities indicate the cities whose natural resources are exhausting and are in a later, late, or even end stages after continuous exploitation, and whose accumulated exploitation reserves have reached more than 70% of the recoverable reserves. China is not the only country that has resource-exhausted cities, in fact, these types of cities exist in many countries around the world. According to the National Development and Reform Commission of China, 69 resource-exhausted cities have been identified in this country (The Chinese Government 2011). These cities may encounter many sustainability risks caused by the resource curse, which have attracted great attention from many scholars and policy-makers (The Chinese Government 2008; Zhang et al. 2018).

The term ‘curse of natural resource’ is proposed by Auty et al. (1998) denoting the paradoxical situation that the development of resource-rich regions falls behind those with relatively poor natural resources. Some studies have found that the resource curse happens in China, especially in the central and western regions (Shao and Qi 2009; Zhang and Brouwer 2020). Shao and Qi (2009) suggested that there was a negative relationship between economic development and energy exploitation in Western China since the 1990s. The sustainability risks of resource-exhausted cities can be closely related to the negative impacts exerted by the resource curse. The transmission effects of the resource curse can be categorized into three main mechanisms (Shao et al. 2020; Szalai 2018), that is Dutch disease, crowding-out effect, and institutional weakening effect.

With China’s economic restructuring and fundamental change of supply–demand in the resources market, resource-exhausted cities of China are faced with a series of dilemmas regarding the economy, society, and environment (Dong et al. 2007; He et al. 2017; Li et al. 2020). The dilemmas include slowing economic development, difficulty in economic transformation, sluggish in fostering new growth points, rising unemployment, insufficient innovation capacity, institutional issues, and deterioration of environmental and ecological systems. However, the sustainability risks of resource-exhausted cities are still unclear, even though numerous studies have examined these risks for Chinese resource-based cities (Lu et al. 2016; Qin et al. 2019).

To fill this research gap, the study constructs a framework covering economic, social, and environmental dimensions for the sustainability evaluation of resource-exhausted cities in China. The sustainability risks of resource-exhausted cities are multidimensional and complicated. The principal component analysis (PCA) is capable of summarizing the information of large dimensions to smaller dimensions while retaining the data information to a maximum degree (Liou et al. 2004; Vega et al. 1998). As such, the study uses the PCA method to evaluate the sustainability risks of resource-exhausted cities in China from 2005 to 2016. The results can be a reference to take stock of where resource-exhausted cities stand in terms of sustainability, identify the potential risks, and further promote sustainable development.

10.2 Methodology and Data

10.2.1 The Principal Component Analysis

The PCA conducts dimension reduction by discarding the highly correlated data information and generates irrelevant components. Thus, PCA can be regarded as one of the most widely adopted statistical tools to reduce dimensions and improve working efficiency for the dataset with many indicators. Considering these advantages of PCA, it has been widely applied in many study areas for performance evaluation and ranking (Omrani et al. 2019; Zhu 1998). In this study, PCA is used to extract information from the dataset with 14 indicators and reduce the dimensions to 7. The procedure of PCA used to evaluate the sustainability risk is provided and some steps are based on Zhu (1998).

Step 1 (Constructing the origin matrix \(X{\text{ = }}\left[ {x_{{ij}} } \right]_{{n \times p}}\)). Suppose the dataset has \(n\) decision-making unit (DMU) and \(p\) indicators. The origin matrix can be constructed as follows.

$$X = \left[ {\begin{array}{*{20}c} {x_{{11}} } & {x_{{12}} } & \cdots & {x_{{1p}} } \\ {x_{{21}} } & {x_{{22}} } & \cdots & {x_{{2p}} } \\ \vdots & \vdots & \ddots & \vdots \\ {x_{{n1}} } & {x_{{n2}} } & \cdots & {x_{{np}} } \\ \end{array} } \right]$$
(10.1)

where \(x_{{ij}}\) is the value of indicator \(j\) under DMU \(i\).

Step 2 (Transform all indicators to positive dimension indicator and obtain \(Y = \left[ {y_{{ij}} } \right]_{{n \times p}}\)). The indicators in negative dimension mean the increase of the value of these indicators could deteriorate the performance of DMUs. For consistency, we need to transform the indicators in negative dimension to positive dimension, as follows.

$$y_{{ij}} = \left\{ {\begin{array}{*{20}c} {x_{{ij}},\;\;{\text{for}}\;{\text{indicators}}\;{\text{in}}\;{\text{positive}}\;{\text{dimension}}\;} \\ { - x_{{ij}}, \;\;{\text{for}}\;{\text{indicators}}\;{\text{in}}\;{\text{negative}}\;{\text{dimension}}} \\ \end{array} } \right.$$
(10.2)

Step 3 (Standardize the matrix \(Y\) and obtain standardized matrix \(Z{\text{ = }}\left[ {z_{{ij}} } \right]_{{n \times p}}\)). Without loss of generality, all the variables should be standardized to ensure each of them has sample mean 0 and variance of 1. The standardization formula is as follows.

$$z_{{ij}} = \frac{{y_{{ij}} - \overline{y} _{j} }}{{s_{j} }}\;\;\;\;i = 1,2,...,n\;\;\;and\;\;\;j = 1,2,...,p$$
(10.3)
$$Z{\text{ = }}\left[ {z_{{ij}} } \right]_{{n \times p}} = \left[ {\begin{array}{*{20}c} {z_{{11}} } & {z_{{12}} } & \cdots & {z_{{1p}} } \\ {z_{{21}} } & {z_{{22}} } & \cdots & {z_{{2p}} } \\ \vdots & \vdots & \ddots & \vdots \\ {z_{{n1}} } & {z_{{n2}} } & \cdots & {z_{{np}} } \\ \end{array} } \right]$$
(10.4)

\(\overline{y}\) and \(s_{j}\) indicate the average value \(\left( {\frac{{\sum\limits_{{i = 1}}^{n} {y_{{ij}} } }}{n}} \right)\) and standard deviation \(\left( {\sqrt {\frac{{\sum\limits_{{i = 1}}^{n} {\left( {y_{{ij}} - \overline{y} _{j} } \right)^{2} } }}{{n - 1}}} } \right)\) of indicator \(j\).

Step 4 (Construct the covariance matrix \(R = \left[ {r_{{ij}} } \right]_{{p \times p}}\)). The covariance matrix among the \(p\) indicators can be expressed as follows:

$$R = \left[ {r_{{ij}} } \right]_{{p \times p}} {\text{ = }}\frac{{Z^{T} Z}}{{n - 1}} = \left[ {\begin{array}{*{20}c} {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {\left( {z_{{i1}} } \right)^{2} } } & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{i1}} } z_{{i2}} } & \cdots & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{i1}} } z_{{ip}} } \\ {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{i2}} } z_{{i1}} } & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {\left( {z_{{i2}} } \right)^{2} } } & \cdots & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{i2}} } z_{{ip}} } \\ \vdots & \vdots & \ddots & \vdots \\ {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{ip}} } z_{{i1}} } & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {z_{{ip}} } z_{{i2}} } & \cdots & {\frac{1}{{n - 1}}\sum\limits_{{i = 1}}^{p} {\left( {z_{{ip}} } \right)^{2} } } \\ \end{array} } \right]$$
(10.5)

Matrix \(Z^{T}\) is the transpose of the matrix \(Z\). The diagonal elements of the matrix \(Z\) indicate the variance of a specific indicator, while other elements mean the covariance between two indicators. If the covariance between indicators is high, this dataset is more suitable for conducting PCA. To be noticed, the covariance matrix \(Z\) is equal to its correlation matrix using the Pearson correlation method since the matrix \(Z\) has been standardized in  step 3. To be specific, the formula of correlation between indicator \(a\) and indicator \(b\) is \(\rho _{{a,b}} = \frac{{COV(a,b)}}{{\sigma _{a} \sigma _{b} }}\) where \(\sigma _{a} = 1\) and \(\sigma _{b} = 1\). Therefore, \(\rho _{{a,b}} = \frac{{COV(a,b)}}{{\sigma _{a} \sigma _{b} }} = COV(a,b)\).

Step 5 (Calculate the eigenvalue (\(\lambda _{k}\)) and eigenvector (\(\vec{b}_{k}\))). \(k\) indicates the newly constructed components. The number of the components is the same as the number of indicator \(j\). The eigenvalue of the component \(k\) is denoted as \(\lambda _{k}\), while the eigenvector is denoted as \(b_{k}\). According to the properties of the matrix, the number of eigenvalues of a matrix is the same as its order. The eigenvalues can be the same (the multiple roots). As such, we can obtain \(p\) eigenvalues and \(\lambda _{1} \ge \lambda _{1} \ge \cdots \ge \lambda _{p} \ge 0\), the calculation process of the eigenvalue is as follows.

$$\left| {R - \lambda I_{p} } \right| = 0$$
(10.6)

where \(I_{p}\) is the identity matrix of size \(p\). The eigenvector (\(\vec{b}_{k}\)) corresponding to the \(k{\text{th}}\) eigenvalue (\(\lambda _{k}\)) can be obtained as follows.

$$R\vec{b}_{k} = \lambda _{k} \vec{b}_{k} ,\;\;\vec{b}_{k} = \left[ {\begin{array}{*{20}c} {b_{{1k}} } \\ {b_{{2k}} } \\ \vdots \\ {b_{{pk}} } \\ \end{array} } \right],\;\;\;\;\;\;\;k = 1,2,...,p$$
(10.7)

Since the covariance matrix \(R\) is symmetric, these \(p\) eigenvectors are perpendicular and not correlated with each other. Then we can obtain a matrix \(B = \left[ {b_{{jk}} } \right]_{{p \times p}}\) containing \(p\) eigenvectors as follows.

$$B = \left[ {b_{{jk}} } \right]_{{p \times p}} {\text{ = }}\left[ {\vec{b}_{1} ,\vec{b}_{2} , \cdots ,\vec{b}_{p} } \right] = \left[ {\begin{array}{*{20}c} {b_{{11}}^{{}} } & {b_{{12}}^{{}} } & \cdots & {b_{{1p}}^{{}} } \\ {b_{{21}}^{{}} } & {b_{{22}}^{{}} } & \cdots & {b_{{2p}}^{{}} } \\ \vdots & \vdots & \ddots & \vdots \\ {b_{{p1}}^{{}} } & {b_{{p2}}^{{}} } & \cdots & {b_{{pp}}^{{}} } \\ \end{array} } \right],\;\;\;j = 1,2,....p,\;\;\;k = 1,2,...,p$$
(10.8)

Step 6 (determine the number of principal components to be considered). The number of the components of the dataset is equal to the number of the eigenvalues/the eigenvectors/the indicators (\(p\)). The eigenvalues can be the same (the multiple roots), and the eigenvectors obtained based on the multiple roots are also irrelevant since the covariance matrix \(R\) is a real symmetric matrix. We choose the top \(m\) components as the principal components based on the descending order of the eigenvalue that can cumulatively express more than 80% of data variance. In other words, more than 80% of the information of the dataset can be explained based on these \(m\) principal components.

$$\frac{{\sum\limits_{{k = 1}}^{m} {\lambda _{k} } }}{{\sum\limits_{{k = 1}}^{p} {\lambda _{k} } }}{\text{ = }}\frac{{\sum\limits_{{k = 1}}^{m} {\lambda _{k} } }}{p}0.8,\;\;\;\;\;\;\;\;k = 1,2,...,m\;\;\;{\text{and}}\;\;\;m \le p$$
(10.9)

Step 7 (Construct the decision matrix \(B^{0} = \left[ {b_{{jk}}^{0} } \right]_{{p \times m}}\)). Unit vector can ensure that the importance level of indicators can be compared among different eigenvectors. Therefore, we transformed the vector \(b_{k}\) into a unit vector as follows.

$$\vec{b}_{k}^{0} = \frac{{\vec{b}_{k} }}{{\left| {\vec{b}_{k} } \right|}}{\text{ = }}\left[ {\begin{array}{*{20}c} {b_{{1k}}^{0} } \\ {b_{{2k}}^{0} } \\ \vdots \\ {b_{{pk}}^{0} } \\ \end{array} } \right],\;\;\;\;\;\;\;\;\;k = 1,2,...,m\;\;\;{\text{and}}\;\;\;m \le p$$
(10.10)

where \(\left| {\vec{b}_{k} } \right|\) is the magnitude (modulus) of the vector \(b_{k}\). \(b_{k}^{0}\) is the unit vector, indicating that the square sum of its components equals 1. The absolute value of the component in the vector \(b_{k}^{0}\) reflects the importance level of the corresponding indicators. The decision matrix \(B^{0}\) can be constructed based on the following equation.

$$B^{0} = \left[ {b_{{jk}}^{0} } \right]_{{p \times m}} {\text{ = }}\left[ {\vec{b}_{1}^{0} ,\vec{b}_{2}^{0} , \ldots ,\vec{b}_{m}^{0} } \right] = \left[ {\begin{array}{*{20}c} {b_{{11}}^{0} } & {b_{{12}}^{0} } & \cdots & {b_{{1m}}^{0} } \\ {b_{{21}}^{0} } & {b_{{22}}^{0} } & \cdots & {b_{{2m}}^{0} } \\ \vdots & \vdots & \ddots & \vdots \\ {b_{{p1}}^{0} } & {b_{{p2}}^{0} } & \cdots & {b_{{pm}}^{0} } \\ \end{array} } \right]$$
(10.11)

To be noticed, the results obtained using SPSS software are the ‘component matrix’ (\(F = \left[ {\vec{f}_{1} ,\vec{f}_{2} , \ldots ,\vec{f}_{m} } \right]_{{p \times m}}\)) with the loadings as its elements instead of the decision matrix (\(B^{0} = \left[ {\vec{b}_{1}^{0} ,\vec{b}_{2}^{0} , \ldots ,\vec{b}_{m}^{0} } \right]_{{p \times m}}\)). The relationship between the ‘component matrix’ and the decision matrix is as follows.

$$\vec{b}_{k} = \frac{{\vec{f}_{k} }}{{\sqrt {\lambda _{k} } }},{\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} {\kern 1pt} \;\;\;\;k = 1,2,...,m\;\;\;{\text{and}}\;\;\;m \le p$$
(10.12)

Step 8 (Construct the primary component models). The principal component is a linear combination of all the indicators that go through the origin. The \(m\) principal components of DMU \(i\) can be formulated as follows.

$$\left\{ {\begin{array}{*{20}c} {F_{{i1}} = b_{{11}}^{0} z_{{i1}} + b_{{21}}^{0} z_{{i2}} + \cdots + b_{{p1}}^{0} z_{{ip}} } \\ {F_{{i2}} = b_{{12}}^{0} z_{{i1}} + b_{{22}}^{0} z_{{i2}} + \cdots + b_{{p2}}^{0} z_{{ip}} } \\ \vdots \\ {F_{{im}} = b_{{pm}}^{0} z_{{i1}} + b_{{p2}}^{0} z_{{i2}} + \cdots + b_{{pm}}^{0} z_{{ip}} } \\ \end{array} } \right.$$
(10.13)

where \(F_{{i1}} ,\;F_{{i2}} , \ldots ,F_{{im}}\) are the \(m\) principal components of DMU \(i\). \(z_{{i1}} ,z_{{i2}} ,\ldots,z_{{ip}}\) indicates the value of indicators from 1 to \(p\) for DMU \(i\). The rank of principal components (\(F_{{i1}} ,\;F_{{i2}} , \ldots ,F_{{im}}\)) is based on the extent of variance included in the components measured by eigenvalue (\(\lambda _{1} \ge \lambda _{1} \ge \cdots \ge \lambda _{m}\)). In other words, \(F_{{i1}}\) includes the maximum variance of the dataset, while \(F_{{im}}\) has the minimum variance. These principal components are not correlated but perpendicular to each other, indicating that they contain the information of different ‘statistical dimensions’.

Step 9 (Evaluate the performance of DMUs). The study constructs the comprehensive component model of DMU \(i\) as follows (Zhu 1998).

$$\begin{gathered} F_{i} = \sum\limits_{{k = 1}}^{m} {W_{k} } \times F_{{ik}} = a_{1} z_{{i1}} + a_{2} z_{{i2}} + \cdots + a_{p} z_{{ip}} ,\;\;\; \hfill \\ i = 1,2,...,n,\;\;\;\;k = 1,2,...,m\;\;\;{\text{and}}\;\;\;m \le p \hfill \\ \end{gathered}$$
(10.14)
$$W_{k} = \left\{ {\begin{array}{*{20}c} {\lambda _{k} /\sum\limits_{{k = 1}}^{m} {\lambda _{k} } ,\;\;\;\;\;if\;\;\;\sum\limits_{{j = 1}}^{p} {b_{{jk}}^{0} } \ge 0\;\;} \\ { - \lambda _{k} /\sum\limits_{{k = 1}}^{m} {\lambda _{k} } ,\;\;\;\;\;if\;\;\;\;\sum\limits_{{j = 1}}^{p} {b_{{jk}}^{0} } < 0} \\ \end{array} } \right.$$
(10.15)

The percentage of variance explained by each component represents its relative importance. Therefore, the study uses the \(\lambda _{k} /\sum\nolimits_{{k = 1}}^{m} {\lambda _{k} }\) as the weight of the primary component \(k\), which is denoted as \(W_{k}\). The sign of \(W_{k}\) depends on the \(\sum\nolimits_{{j = 1}}^{p} {b_{{jk}}^{0} }\), and a negative sign will be added to \(W_{k}\) once \(\sum\nolimits_{{j = 1}}^{p} {b_{{jk}}^{0} }\) is negative. The sign change does not change the explanation of the principal component (Zhu 1998). \(a_{j}\) means the importance level of the indicator \(j\). \(F_{i}\) indicates the performance scores of DMU \(i\). The higher the \(F_{i}\), the better the performance is. \(z_{{ij}}\) is the standardized value of the indicator \(j\) for DMU \(i\) based on step 3.

Step 10 (Standardize the performance of DMUs to make scores range from 0 to 100). The performance score of DMUs based on step 9 can be negative or positive, which makes it not straightforward to compare and demonstrate. Therefore, we further standardize the scores to make them range from 0 to 100. The higher the standardized scores of a DMU (\(F_{i}^{'}\)), the better the sustainability is. 0 indicates the worst performance, while 100 means the best performance. The standardization process is as follows.

$$F_{i}^{'} = 100 \times \frac{{F_{i} - \mathop {\min }\limits_{{1 \le i \le n}} (F_{i} )}}{{\mathop {\max }\limits_{{1 \le i \le n}} (F_{i} ) - \mathop {\min }\limits_{{1 \le i \le n}} (F_{i} )}}$$
(10.16)

where \(\mathop {\min }\limits_{{1 \le i \le n}} (F_{i} )\) and \(\mathop {\max }\limits_{{1 \le i \le n}} (F_{i} )\) indicate the minimum and maximum value among all the \(F_{i}\) of DMU \(i\).

10.2.2 Sample Cities and Data

Table 10.1 presents the indicators of three aspects, including economy, environment, and society. 14 indicators are included in this study. The data of these indicators are collected from the statistical yearbook of each city. The innovation capacity indicates the patent per capita. The patent data are collected from the State Intellectual Property Office of China, which includes invention patents, utility model patents, and design patents. The GDP has been converted to the 2005 constant price based on the GDP index. CO2 emissions are sourced from Chen et al. (2020).

Table 10.1 List of indicators regarding economy, society, and environment

According to ‘The Plan for the Sustainable Development of Chinese Resource-based Cities (2013–2020)’ issued by the State Council, there are 262 resource-based cities in China, among which 69 are resource-exhausted. In this study, owing to the data unavailability at the county level, we only take the prefectural level cities into consideration. A prefectural-level city doesn’t indicate the usual meaning of the term (i.e., urban settlement), but means an administrative unit that with not only urban area but rural area. Daxinganling city is also not included because of insufficient data. Thus, a total of 24 prefectural level cities are studied and some descriptions of these 24 cities can be found in Table 10.2.

Table 10.2 List of resource-exhausted cities at the prefectural level of China

10.3 Structure Detection of Dataset

Table 10.3 shows the results of Bartlett’s test and Kaiser–Meyer–Olkin measure, which are tested using the SPSS software. The Kaiser–Meyer–Olkin measure is a test that shows the share of variance in the indicators which could be caused by underlying factors (Dziuban and Shirkey 1974). The result of the Kaiser–Meyer–Olkin measure ranges from 0 to 1 (Dziuban and Shirkey 1974). It is generally more convincing to conduct PCA analysis if the value of the result becomes higher. If the value is larger than 0.6, indicating the sample complies with the requirement of data structure and can use the PCA (Dziuban and Shirkey 1974). For Bartlett's test of sphericity, the null hypothesis is that the correlation matrix of the dataset is an identity matrix. If it cannot reject the null hypothesis, it means that your indicators are not related and thus it is not appropriate for structure detection. Small values (less than 0.05) of the significance level indicate that factor analysis may be useful with this dataset.

Table 10.3 Results of the Kaiser–Meyer–Olkin measure

The results in Table 10.3 show that it is suitable for the dataset to conduct structure detection since the value of the Kaiser–Meyer–Olkin measure is 0.630 and the null hypothesis is rejected at a 1% significant level.

Table 10.4 can be a reference to show the meaning of value based on the Kaiser–Meyer–Olkin measure.

Table 10.4 Interpretation of the results using Kaiser–Meyer–Olkin measure (Dziuban and Shirkey 1974)

Table 10.5 shows the correlation matrix of 14 indicators. The order of the variables shown in Table 10.5 is the same as in Table 10.1. The bottom left triangle indicates the Pearson correlation, while the top right triangle means the Spearman correlation. The eigenvalue and eigenvector are based on the correlation matrix measured by the Pearson correlation and this matrix is symmetric. The symbols *, **, and *** indicates the value is significant at 10%, 5%, and 1% levels, respectively. The value in the matrix shows that many indicators are significantly correlated with each other, and the PCA can be used to reduce the correlation and simplify the dimensions.

Table 10.5 Correlation matrix of the 14 variables

10.4 The Weights and Sustainability Performance Based on the Principal Component Analysis

Table 10.6 shows the communalities of 14 indicators. Extraction commonalities can be used to evaluate the variance of each indicator that can be extracted by the factors in the PCA. If the extraction is low, this indicator is not fit well with the factor solution and can be discarded in further analysis. The results in Table 10.6 show that the initial variance included in the dataset is 1 for all indicators. The extraction commonalities in the PCA are fit well, with the lowest level at 0.736 for the indicator ‘GDP growth rate’.

Table 10.6 Communalities of 14 indicators

Table 10.7 shows the total variance explained for each component. The leftmost section of Table 10.7 demonstrates the variance explained by the 14 components. There are 14 components in total, among which five components have eigenvalues larger than 1. The share of variance is equal to the share of the corresponding eigenvalue in the total eigenvalues considered. The total value of the eigenvalues is 14, and thus the share of variance for the first component is 24.226%, which means that 24.226% of the variance in the dataset can be explained by this component. For the second component, 19.541% of the variance can be explained using this component. The share of cumulative variance explained by the first and the second components is 43.767%.

Table 10.7 Total variance explained

The second section of Table 10.7 shows the variance that can be explained by the extracted components before conducting rotation. The cumulative variability explained by the top seven components in the extracted solution is about 81.131%, which is the same as the initial solution. Thus, there is no loss in the variation explained by the initial solution. In the case there is a difference between the values in the first section and the second section, it can be interpreted that the latent components are unique to the original variables and variability that simply cannot be explained by the component model. This study considers the top seven components as the principal component, which take up 81.131% of the variability of the dataset. The rightmost section of this table shows the variance explained by the extracted factors after rotation. The rotated factor model makes some small adjustments for all these 7 components.

The results of the decision matrix can be found in Table 10.8. There are seven principal components considered, each of which has an eigenvalue and an eigenvector (Table 10.8). Table 10.9 shows the component matrix obtained using SPSS software instead of the MATLAB code shown in Sect. 10.6. The results obtained using SPSS software are the ‘component matrix’ (\(F = \left[ {f_{1}^{{}} ,f_{2}^{{}} , \ldots f_{m}^{{}} } \right]_{{p \times m}}\)) instead of the decision matrix (\(B^{0} = \left[ {b_{1}^{0} ,b_{2}^{0} , \ldots b_{p}^{0} } \right]_{{p \times m}}\)). The ‘component matrix’ can be transferred to the decision matrix based on \(\vec{b}_{k} = \frac{{\vec{f}_{k} }}{{\sqrt {\lambda _{k} } }}\), as shown in step 7 in Sect. 10.2.1. Based on Table 10.8, the first principal component (PC1) is mainly explained by Var 10 (0.490), followed by Var 9 (–0.410), and Var 12 (0.394). Var 10, Var 9, and Var 12 indicate ‘CO2 emissions per capita’, Number of public transportation vehicles per 10,000 persons’, and ‘Volume of industry sulfur per capita’, respectively. For the second principal component (PC2), it is primarily composed of Var 7 (–0.394), Var 14 (–0.369), and Var 1 (–0.369). Var 7, Var 14, and Var 1 means ‘Innovation capacity’, ‘Ratio of wastewater centralized treated of sewage work’, and ‘ GDP per capita’, respectively. The top three compositions of each principal component have been highlighted in bold as shown in Table 10.8.

Table 10.8 Decision matrix of the seven principal components
Table 10.9 Component Matrix

Table 10.10 shows the transformation matrix of the seven principal components. This matrix can be used to demonstrate the rotation level of the component matrix compared with the unrotated component matrix. If the values of off-diagonal elements are small, it indicates there are smaller rotations between these two matrices. By contrast, the large values in the off-diagonal elements mean larger rotations.

Table 10.10 Component transformation matrix

Table 10.11 shows the sustainability performance of 24 resource-exhausted cities from 2005 to 2016, while Table 10.12 shows the sustainability performance after standardization. The average sustainability of these cities is 53.783 in 2015, while the level increases to 64.297 in 2016 (see Table 10.12). However, there is still much room for improvement. Among the 24 cities, Shuangyashan, Fushun, Fuxin, Shizuishan, and Wuhai see the most significant risk in sustainability, at a mere 52.242, 52.447, 47.371, 34.062, and 4.113 in 2016 (see Table 10.12).

Table 10.11 The sustainability scores of 24 resource-exhausted cities from 2005 to 2016
Table 10.12 The standardized scores of sustainability of 24 resource-exhausted cities from 2005 to 2016

Figure 10.1 compares the sustainability level of the 24 resource-exhausted cities in 2005 with that in 2016. Most of the resource-exhausted cities see an increase in sustainability from 2005 to 2016, except Panjin, Shuangyashan, and Yichun-HLJ (see Fig. 10.1). This indicates that Panjin, Shuangyashan, and Yichun-HLJ face significant sustainability risks and prompt actions should be taken to reverse this deterioration. Panjin, Shuangyashan, and Yichun-HLJ are located in Liaoning province, Heilongjiang province, and Heilongjiang province, respectively. These two provinces belong to Northeast China and are used to be the old industrial base of China. Because of the decline of its once-powerful resource-related sectors, the Northeast region is named the Rust Belt of China (Campbell 2005; Xiao et al. 2019a), whose development path is different from the Yangtze River Delta region (Xiao et al. 2019b). Many cities in Northeast China used to be abundant in natural resources. However, after decades of the exploitation of natural resources, many cities are facing resource depletion problems and Panjin, Shuangyashan, and Yichun-HLJ are the typical cases. The sustainability risks of these cities threaten the long-term development and integrated measures should be taken to improve sustainability and reduce the risk of deterioration.

Fig. 10.1
figure 1

The sustainability scores of 24 resource-exhausted cities in 2005 and 2016. The value in the figure is based on Table 10.12

10.5 Conclusions and Policies for Mitigating Sustainability Risks

Considering the significant sustainability risks faced by the resource-exhausted cities, the study constructs an evaluation framework covering economic, social, and environmental dimensions to evaluate the sustainability of resource-exhausted cities and identify potential risks. The PCA is used to evaluate the sustainability risks of 24 resource-exhausted cities in China from 2005 to 2016. The main findings are as follows:

The sustainability of resource-exhausted cities sees an increasing trend. The average sustainability of these cities is 53.783 in 2015, while the level increases to 64.297 in 2016. However, there is still much room for improvement. Among the 24 cities, Shuangyashan, Fushun, Fuxin, Shizuishan, and Wuhai see the most significant risk in sustainability, at a mere 52.242, 52.447, 47.371, 34.062, and 4.113 in 2016.

Panjin, Shuangyashan, and Yichun-HLJ are the only three cities whose sustainability sees a decreasing trend from 2005 to 2016. This indicates that Panjin, Shuangyashan, and Yichun-HLJ face relatively significant sustainability risks and prompt actions should be taken to reverse this deterioration. These three cities are located in the Northeast region, the Rust Belt of China. Many cities in Northeast China used to be abundant in natural resources. However, after decades of the exploitation of natural resources, many cities are facing resource depletion problems, and Panjin, Shuangyashan, and Yichun-HLJ are the typical cases. The sustainability risks of these cities could threaten the long-term development and integrated measures needed to be taken to improve sustainability and reduce the risk of deterioration.

Some policies are proposed to mitigate the sustainability risks of resource-exhausted cities. First, policy-makers should keep a close eye on the economic, social, and environmental development of resource-exhausted cities. The evaluation framework used in this study can be a reference to take stock of where resource-exhausted cities stand in terms of sustainability, identify the potential risks, and further promote sustainable development. Second, for the laggard cities in sustainability, relevant policies should be implemented to mitigate the sustainability risks and then reverse the deterioration process. For the other cities, they should plan in advance to avoid getting into similar situations.

10.6 Code Availability

The MATLAB code used to evaluate the performance of the DUMs using principal component analysis (PCA) is published for transparency, as follows.

figure afigure a