Introduction

Knowledge accumulation and technological progress are sources of regional sustained economic growth (Lucas 1988; Romer 1990). As carriers of knowledge and technology, highly educated talents are key drivers of national and regional development. How to attract talents, retain talents, and make use of talents effectively is a common concern of governments around the world (Lee et al. 2004). The concentration of talents can bring about a variety of agglomeration economies, such as knowledge spillovers, low-cost communication and learning externalities, and a specialized talent pool for certain industries that need intelligence (Fujita and Thisse 1996). China has experienced dramatic economic restructuring and industrial upgrading in recent years, and technological innovation has therefore become the core driving force for national and regional development (Duan et al. 2016; Liu and Shen 2014; Xu and Ouyang 2018). Talents are undoubtedly the first element to promote technological innovation and regional economic and social development (Florida et al. 2012). Therefore, Chinese central government has implemented several policies to illustrate the importance of highly educated talents to China’s regional economic growth in the transition period. For example, in 2010, the State Council of China promulgated the first medium- and long-term talent-development planning document, namely National Medium- and Long-Term Talent Development Plan (2010–2020). Its outline points out that China must change from a country with huge human resources to a country with a huge amount of talents, and win an active strategic battle in the fierce international competition. Furthermore, in recent years, Chinese local governments have formulated a series of public policies for attracting and retaining highly educated talents (Yuan and Wan 2007), which have set off an intensified wave of competition for talents.

In recent years, the increase in China’s highly educated talents has become an ongoing trend. According to the 1% national sample survey of the population in 2015, China has a population with a college education or above reaching 170.93 million, an increase of 42.87% over the same period in 2010. A burgeoning body of literature has focused on the spatial patterns of highly educated people at provincial or prefecture level in China and drivers of those patterns (Bao et al. 2007; Gu et al. 1999; Liu and Shen 2014; Liu et al. 2017; Miu et al. 2013; Wang and Zhang 2003; Zhang et al. 2011). According to these studies, there is an unbalanced trend of talent distribution in China, which has led to a regional imbalance of talent demand (Liu and Shen 2014; Liu et al. 2017). The concentration effect of talents in those developed cities has further squeezed the development room for undeveloped small and medium-sized cities. Furthermore, there is likely to be a positive spatial spillover effect in the distribution of talents across space because of a positive externality in the number of talents or a positive spatial autocorrelation in determinants associated to the number of talents in neighboring cities (Miguélez et al. 2010). However, few researches have been conducted on the spatial spillover effect in Chinese highly educated talents at city level. In addition, due to data availability, most studies focus on the distribution and determinants of all kinds of China’s talents before 2010 (e.g., Liu and Shen 2014; Liu et al. 2017). Under this context, we hope to contribute to previous studies in following aspects: (a) using data from 1% national population survey, this paper examines the distribution and drivers of highly educated talents at city level for the first time; (b) using spatial econometrics methods, this study attempts to investigate whether the distribution of highly educated talents between cities shows a spatial spillover effect.

The main object of this paper is to examine the spatial pattern, determinants and spatial spillover effect of highly educated talents at prefecture level units in China, by using data derived from the 2015 national 1% population sampling survey data. The next section begins with a systematic summary of the spatial pattern and influencing factors of highly educated talents. The third section explains the research design, including the research data, regions and research methods. Then, there is a description of several GIS spatial analysis methods involved in the spatial analysis of the distribution of talents. The fifth section constructs a series of econometric models (including nonspatial models and spatial econometric models) to explore the impacts of the concentration of highly educated talents. Then, there are some conclusions, and the final section offers relevant talent management recommendations.

Literature Review

Since the pioneering of population economic theory, many have seen economic opportunities as the primary conditions affecting the migration of talents (e.g., Lewis 1954; Todaro 1969). Regional economic opportunities include specific elements, such as wages relating to the human capital of the talents themselves, employment environment conditions, the openness of job market, opportunities for talent future development, and regional industrial structure. In the middle of the twentieth century, economists, demographers, and geographers explored the mechanism behind labor mobility from the perspectives of macroeconomics and microeconomic individuals, based on hypothesis of rational-economic man, and they proposed a series of classical theories. From the microeconomic perspective, economists believed that each worker has sufficient economic rationality, and that the motivation for migration comes from economic benefits (Lewis 1954; Stark and Bloom 1985; Todaro 1969). When wage income is greater than migration cost, labor has sufficient motivation to move (Esenwein-Rothe 1964). Todaro (1969) set out the migration choices of laborers from a microscopic perspective. The motivation for labor migration lies in expected income rather than absolute income. The labor migration decision is the result of comparing expected income with migration costs. In the 1980s, the focus of migration economics theory shifted from the individual to the family. Stark and Bloom (1985) argued that labor migration is the result of household income diversification and risk minimization decisions. Labor migration occurs not only to maximize individual economic benefits, but also to achieve household economic income goals. Once laborers achieve their expected income targets, they may choose to settle down.

Krugman (1991) proposed the concept of new economic geography. In the framework of monopolistic competition, he put the transportation cost and manufacturing scale return increment into the model, and he proposed a core-periphery model (Krugman 1991). Regardless of externalities, the model can explain whether the spatial pattern of labor force is agglomerated or discrete simply by looking at increasing returns of scale and improved transportation costs (Fujita and Thisse 1996; Krugman 1991).

Another group of scholars began to study the impact of amenity factors on the migration of talents (e.g., Glaeser et al. 2001). Based on the perspective of equilibrium theory, Graves (1976) optimized the migration model of Cebula and Vedder (1973). Assuming that individuals’ utilities in every region are the same in equilibrium, Graves (1976, 1979) found that the economic income of each region compensated for its level of amenities for laborers. Therefore, the migration of laborers is essentially the result of the need for regional amenities rather than economic income (Knapp and Gravest 1989; Mueser and Graves 1995). On the basis of Graves’s (1976) equilibrium theory, Glaeser et al. (2001) put forward the hypothesis of the consumer city. They believed that the key to urban sustainable development lies in the quality of services and consumer goods available in the city. A city with a higher degree of amenities will therefore attract more talents to live in it, thus placing its economic development and innovation ahead of cities with low amenity levels.

Florida (2002) defined the conception of creative class, which transcended the definition of traditional highly skilled labors with academic or occupational divisions. He found that the reason for the accumulation of innovative talents was a city’s facilities, including both its natural conditions, such as urban climate, and the inclusive humanistic environment of the city. Clark et al. (2002) pointed out that improvements in transportation technologies have reduced the costs of transporting goods and migrating, and a multi-center urban pattern has gradually formed. At this stage, the key to attracting talents lies in the supply of public goods and the physical facilities of the city (Clark et al. 2002).

Despite the increased focus on amenity factors in recent years, scholars have not reached a consensus on the incidences of economic opportunities and amenity factors, and therefore they have launched a heated discussion around this topic (Arntz 2010; Faggian et al. 2017; Liu and Shen 2014; Niedomysl et al. 2010; Scott 2010). Although several scholars acknowledged the importance of amenity factors, these factors should be secondary considerations in labor migration decision-making (Arntz 2010; Greenwood and Hunt 1989; Niedomysl and Hansen 2010). Other scholars believed that the impact of amenities, especially the natural environment, on the distribution of talents is still unverified (Liu and Shen 2014). For example, although several industrial cities in the interior of the United States do not have good living environments, they have attracted a large number of talents to work because they can provide a variety of employment opportunities (Chiquiar and Hanson 2005; Davies et al. 2001). In addition, the factors influencing talents gathering may be different at different historical stages, and the impact of job opportunities and local, quality-related factors varies among different life stages of the talents. There are also differences in the types of talents (e.g., high-educated immigrants and low-educated immigrants) (Chen and Rosenthal 2008; Ferguson et al. 2007; Niedomysl and Hansen 2010; Scott 2010). Gottlieb and Joseph (2006) found that compared with other academic talents, doctoral students pay less attention to wages when choosing employment locations, but instead they focus on amenity factors such as climate, public services, and crime rates.

Because of the spatial correlation of driving forces that affect talent distribution, such as economic linkages and policy response, there is also likely to be a spatial autocorrelation (i.e. spatial spillover) effect in the spatial distribution of talents. So far, a burgeoning but small body of literature has paid attention to the spatial spillover effect in the distribution of talents. Miguélez et al. (2010) have confirmed that there were a strong and positive spatial autocorrelation in the inflows of inventors across European regions, and that this spatial autocorrelation effect came from three major reasons: the attractiveness of a certain region, a certain region’s congestion effect, and the influence of country specific features. Rodríguez-Pose and Tselios (2011) also found a positive spatial spillover effect in the distribution of education attainment in 102 regions in Western Europe. In recent years, scholars have begun to conduct several empirical studies using China as a case. First, research showed that economic opportunities are the primary force shaping the migration and distribution of talents in China (Gu et al. 2019a; Liu and Shen 2014; Yu et al. 2019). According to this stream of research, regional GDP, per capita income, unemployment rate, and industrial structure are the key forces influencing the spatial pattern of Chinese talents. Some scholars found that the western region in China was unable to attract talents because of its unsatisfactory talent market environment, talent entrepreneurial environment, and talent economic development environment (Bao et al. 2007). Some scholars started from the micro level, and they conducted research through questionnaires to determine the economic factors such as job opportunities, living conditions and wage levels, which are the main drivers of migration, further (Gu et al. 1999; Wang and Zhang 2003). In addition, scholars have further demonstrated the decisive role of economic opportunities through research on different types of talents, including highly skilled labor, scientific research talents, and highly educated talents (Miu et al. 2013; Zhang et al. 2011).

In addition, domestic scholars have begun to explore the role of amenity factors in the distribution of talents (e.g., Liu and Shen 2014). First, they found that public services such as health and education reflect the impacts of amenities. The attraction of talents in big cities is mainly the result of the supply of social security (Deng et al. 2001; Luo 2001; Zhang et al. 2017). Second, despite the importance of social and cultural factors, the impact of these variables on talents is not easy to observe through quantitative data; thus, a few scholars have used questionnaire methods to conduct research from the scope of micro-individuals (Li et al. 2018). Third, consumer facilities reflect the convenience of urban life, and it is the common psychology of talents to live in convenience and comfort (Liu et al. 2017). Fourth, because of the increasingly long commute time between cities, the impact of transportation facilities on talents is increasingly significant (Deng et al. 2001; Xu and Zhang 2010).

Furthermore, the relevant literature on comparing the impacts of economic opportunities and amenities agrees that economic opportunities are the main determinants shaping Chinese talents, whereas amenity variables, although significant in some studies (Song et al. 2016; Yu et al. 2019), are less attractive to talents (Liu and Shen 2014). However, existing research mainly uses the census data of 2000, 2005, or 2010. After the Chinese government proposed the National Medium- and Long-Term Talent Development Plan (2010–2020) in 2010, it is still unclear which forces actually affect the distribution of Chinese talents due to the lack of the necessary census data. In addition, the measurement of amenity in previous studies is not comprehensive. For example, in previous studies, researchers paid less attention to consumption diversity and regional traffic accessibility.

Also, a small body of studies have been conducted on the spatial spillover effect in the distribution of talents. A positive spatial spillover effect is also found in the spatial patterns of the birthplace and the workplace of academicians of Chinese Academy of Sciences over the period of 1955 to 2011 (Li et al. 2013). Using provincial level panel data, Song et al. (2016) has shown that a spatial autocorrelation effect was also seen in the employment of IT service sector and IT manufacturing sector. Research on spatial spillover effect is necessary for better understanding of the driving forces of talents’ spatial moves or developing regional talent policies in China. However, there is little literature on spatial spillover effect in the distribution of highly educated talents. This paper hopes to address this literature gap by using spatial econometrics models.

Research Data, Area and Methodologies

Research Data

The basic data for our paper came from the 2015 national 1% population sample survey. From the dataset, there were 147,139 thousand samples with a college education or above in 2015, accounting for 10.71% of the total population. We define talents as “people with junior college degrees or above.” The benefits of using academic qualifications to define talents are reflected in the fact that academic qualifications can better represent the human capital and labor capital of the labor force. In addition, relevant talent policies in Chinese cities often use academic qualifications to divide talents. Using academic qualifications to define talents can better match the actual talent needs of each city.

To ameliorate the endogeneity problem of reverse causality, the dependent variable in our article has a one-period lag from the independent variable. The independent variables came from the China City Statistical Yearbook from 2015, reflecting the economic and social conditions of cities at the end of 2014. The meteorological data for each city for 2014 came from the National Weather Station of the National Meteorological Information Center. We aggregated the weather data to local cities. The national road network (provincial road, national highway, highway) data necessary for calculating road network accessibility came from the China Traffic Atlas of 2015. Specifically, variables mainly fell into three sets: economic opportunities variables, amenity variables, and control variables.

Research Area

Our research area covers 31 provinces in China, excluding Hong Kong, Macao, and Taiwan. The basic geographic unit is the prefecture level, because the city is the basic unit for formulating talents and population management policies in China, and the differences in factors that influence the distribution of talents in various cities are huge. We aggregated highly educated talents with a college education or above to prefecture-level units according to the regional administrative code of each prefecture-level city, and we finally obtained 309 prefecture-level units as our basic units. We measured the spatial pattern of highly educated talents according to two types of statistic specifications: the number of highly educated talents and the density of highly educated talents (persons/km2). In the regression model, considering the availability of data and the continuity of space, we selected a subset of 224 prefecture-level cities as the sample for regression analysis.

Research Methodologies

Gini Index (GI)

The Gini coefficient measures the imbalance of highly educated talent distribution in 309 prefecture-level units in China, with its formula as follows (Liu and Gu 2019):

$$ G=\frac{\sum_{i=1}^n{lnx}_i\times {lny}_i/{\sum}_{i=1}^n{\left({lnx}_i\right)}^2-1}{\sum_{i=1}^n{lnx}_i\times {lny}_i/{\sum}_{i=1}^n{\left({lnx}_i\right)}^2+1} $$
(1)

where xi is the cumulative proportion of highly educated talents in each municipal unit until city i and yi is the cumulative proportion of highly educated talents in each city until city i.

Exploratory Spatial Data Analysis

Spatial autocorrelation of highly educated talents among cities is an aspect of understanding its spatial pattern. Spatial autocorrelation stems from the first law of geography, which means that the attribute values of objects have correlations with each other in neighboring regions (Tobler 1970). Derived from the Pearson correlation coefficient, Moran’s I Coefficient (MC) is the most popular method of spatial autocorrelation (Gu et al. 2019b). The formula is:

$$ MC=\frac{X^{\prime}\boldsymbol{W}X}{X^{\prime }X} $$
(2)

where X represents the column vector of observations and W represents the standardized spatial weight matrix. MC can measure the spatial autocorrelation of highly educated talent density in cities, and, to a certain extent, it represents whether the density or the number of highly educated talents in each city is agglomerated, randomly distributed, or discretely distributed.

Furthermore, we use local spatial autocorrelation (LISA) to measure the association of the density of highly educated talents in each city with its neighboring cities and to identify the spatial cluster pattern (Anselin 1995). The Formula is as follows:

$$ Local\ {Moran}^{\hbox{'}}s\ I=\frac{n\left({x}_i-\overline{x}\right){\sum}_{j=1}^m{W}_{ij}\left({x}_j-\overline{x}\right)}{\sum_{i=1}^n{\left({x}_i-\overline{x}\right)}^2},\left(i\ne j\right) $$
(3)

Where xi and xj are the density of talents of city i and city j; n is the number of cities; Wij is the spatial weight matrix, and \( \overline{x} \) is the average of x. Using LISA, four cluster patterns can be detected according to the association of talent density in one city and its neighboring cities. A High-High (HH) city has high talent density with average neighboring talent density above the mean. A Low-Low (LL) city has low talent density with average neighboring talent density below the mean. A High-Low (LH) city has high talent density with average neighboring talent density below the mean. A Low-High (LH) city has a low talent density with average neighboring talent density above the mean. Cities that fail statistical tests are indicated as not significant (NS).

Spatial Econometrics Models

Spatial econometrics models deal with spatial effects in data by adding operators that reflect spatial autocorrelation in model settings (Stillwell et al. 2018). When using traditional linear or nonlinear models to examine economic drivers and amenity drivers of the spatial distribution of highly educated talents, it is easy to overlook the spatial autocorrelation problem in the data due to externality, unobservable variables, or missing variables. Therefore, the error term in the traditional regression does not satisfy the assumption of i.i.d., which leads to a larger estimation error or a decrease in the validity of the model. We employed a general spatial model (GSM), considering two spatial spillover effects (spatial lag and spatial error effects) together. The formula is as follows:

$$ \mathrm{y}=\rho {W}_1y+ x\beta +\mu, \mu =\lambda {W}_2\mu +\varepsilon, \varepsilon \sim \left(0,{\sigma}^2{I}_n\right) $$
(4)

where ρW1y and λW2μ represent spatial lag terms of the dependent variable and the random error term respectively, and ε is a random error term that satisfies the independent and identical distribution. Supported by these spatial econometrics, it is possible to examine the driving mechanism of talent distribution.

Spatial Pattern of Chinese Highly Educated Talents

A Spatially Concentrated and Unbalanced Pattern of Highly Educated Talents

Firstly, we calculated the GI of talent density (persons/km2) of China’s cities. The GI of talents reached 0.567 (> 0.5), indicating that the highly educated talents are very unevenly distributed among prefecture-level units. Secondly, we mapped the density of highly educated talents in each city using ArcGIS. As Fig. 1 shows, China’s highly educated talents are highly concentrated in a small subset of cities. High-ranking cities include first-tier cities such as Beijing Shanghai, Guangzhou and Shenzhen, capital cities in the central and eastern regions such as Zhengzhou, Taiyuan, Wuhan, and Nanjing, and Dongguan, Zhuhai, and Zhongshan in the Pearl River Delta. In summary, the distribution of talent density of cities demonstrates a spatially concentrated and unbalanced pattern.

Fig. 1
figure 1

Spatial distribution of density of highly educated talents in 2015

A Clustered Spatial Pattern of Highly Educated Talents

The global spatial autocorrelation analysis measures the overall level of spatial autocorrelation of the density and the number of highly educated talents among cities. Using Geoda software, it is possible to calculate the two spatial autocorrelation values. The MC value of the density of talents in China is 0.145 (p < 0.001), while this index for the number of highly educated talents reaches 0.073 (p < 0.001). The above results show that there is a significant spatial spillover effect in both the density and the number of high-level talents in prefecture-level cities. Furthermore, the LISA map can measure the similarity and dissimilarity of the attributes of particular geographic units to their surrounding units. In Geoda software, we obtained the LISA result of spatial clusters of Chinese talents. As Fig. 2 shows, the red parts are HH clusters, including several cities in the Yangtze River Delta urban agglomerations such as Shanghai and Hangzhou, and a few cities in the Pearl River Delta urban agglomerations like Zhongshan and Zhuhai. These two urban agglomerations have played a strong joint driving role in attracting talents. Some capital cities (e.g., Xi’an, Chengdu and Guiyang) in central and western provinces are detected as HL cluster areas (pink parts). These cities act as growth poles in these provinces, with adequate development endowments compared to the neighboring cities. The green parts are LH clusters, including those areas surrounding HH clusters like the western and eastern part of Guangdong, the northern part of Juangsu and the southern part of Zhejiang. LL clusters are covered by blue parts. The results show that the northeast regions, the western regions and parts of the central regions of China are less attracted to highly educated talents.

Fig. 2
figure 2

LISA map of talent distribution pattern

Determinants and Spatial Effects of Highly Educated Talents

Variable Selection Strategy

The dependent variable of this study is the number of highly educated talents in each city in 2015, while the independent variables mainly include economic opportunity variables, amenity variables, and control variables (Table 1). In terms of economic opportunity variables, regional GDP (GDP) is the representative of regional economic development level and the future development prospect of a region, which many consider the main economic driver affecting the distribution of talents. In addition, average wage (WAGE) and unemployment rate (UNEMP) of a region are also important factors affecting the settlement of talents. With the development of the economy, the labor force will shift from primary and secondary industries to tertiary industry, thus industrial structure (INDUSTRY) may also exert a certain impact on talents. In summary, we selected the above five variables to represent economic opportunities.

Table 1 Description of variables and expected effect

According to Gleaser et al.’s (Glaeser et al. 2001) definition, amenities include four parts: a good natural environment, a supply of public services, a high speed of transportation and communication, and a variety of consumer facilities. With the improvement of people’s living standards, talents are paying increasing attention to the comfort of the natural environment. The following formula gives the comfort index of each city:

$$ I=T-0.55\left(1- RH\right)\left(T-58\right) $$
(5)

where I denotes the comfort index, T is the temperature (Fahrenheit), and RH is the relative humidity (%). Bai et al. (2009) further divided the human comfort level into nine levels, and they considered 65 as the absolute comfort value. We constructed a continuous comfort index, calculating the distance from each city’s comfort index to the absolute comfort value as a quantitative indicator of urban comfort. The smaller the distance, the more comfortable this city is.

Finally, we selected four indicators to evaluate the comfort of the natural environment, including air quality (SO2), water quality (WATER), urban greening rate (GREENING), and comfort index (AMENITY).

Furthermore, public services include basic services such as medical care and education. In China, public services and their infrastructure provide a good guarantee for talents to work and live. We selected the ratio of per capita science and education expenditure (SEBYEXP), the ratio of per capita fiscal expenditure (EXPBYINC), the number of teachers per 10,000 primary school students (TEACHER), and the number of doctors per capita (DOCTOR) to measure the impact of public services on the distribution of talents.

Glaeser et al. (2001) believed that contemporary urban growth depends on its ability to provide services and consumer goods. We used the number of star-rated hotels (CONSUME) as a proxy variable to measure the ability of a city to provide consumer services. In addition, transportation is an important medium for the mobility of labor. Highways and railways are the commonest means of transportation for talents. We used road accessibility (SPEED) and the number of railway passengers (RAIL) to reflect a city’s traffic development, within which sDNA software calculated road accessibility as follows:

$$ SPEED(x)={\sum}_{y\in Rx}\frac{p(y)}{d\left(x,y\right)} $$
(6)

where p(y) is the weight of node y in the radius R. In continuous space analysis, p(y) ∈ [0,1]; d(x,y) is the shortest topological distance from node x to node y; SPEED(x) is the accessibility index. We set 100 km as the radius.

In addition, some factors have a latent impact on the distribution of talents, including population density (DENSITY), fixed investments (FINVEST), administrative factors (PROVINCE), and talent supply (UNISTU). Thus, we included four control variables in our model.

Model Processing

First, we tested whether there was significant spatial autocorrelation in the model error terms based on a classical OLS regression model. The result shows that the residuals still have strong spatial autocorrelation, which confirms our speculation. The existence of spatial autocorrelation in the error terms violates the assumption of i.i.d. in the model. To solve this problem, it is necessary to use a spatial econometric model considering spatial autocorrelation.

We used GeodaSpace software to support the selection and estimation of spatial econometrics models. First, we added the above 19 explanatory variables to an OLS model, and we tested its spatial spillover effect and possible sources (Table 2). The results showed that there is a spatial dependence effect in the model residual, and the Lagrange Multiplier and Robust LM tests for spatial lag and spatial error effects were both significant, indicating that these two mechanisms may be leading to spatial autocorrelation. In addition, we tested for heteroscedasticity. The results of the Breucsh-Pagan test and the Koenker-Bassett test were significant, indicating that the data contained strong heteroscedasticity. Therefore, we used robust standard errors in estimating coefficients. Finally, we performed a strict multicollinearity test. The results showed that there was no strict multicollinearity problem in the data. According to the above tests, the application of GSM is suitable for our case.

Table 2 Tests for spatial econometrics model

The regression strategy was as follows. Model 2 only included four variables that reflect economic opportunities. Model 3 only included 11 variables that reflect amenities. Model 4 incorporated both economic opportunity variables and amenity variables. We further added control variables to Model 5 at the end of model process. Through this kind of strategy, it is possible to examine the mechanism of economic opportunity and amenity factors to some extent.

Result Analysis

The regression results are in Table 3. In Model 2, the economic opportunity variables (except for UMEMP) had a significant impact on the number of highly educated talents. Specifically, improvement in regional development level and regional wage level, and the optimization of the regional industrial structure both increased the number of highly educated talents. From the perspective of the spatial spillover effect, both the spatial lag effect of the dependent variable (ρ) and the spatial error effect of the model error term (λ) were significant, but the two influences were opposite: an increase in the number of talents in a region resulted in a decrease in talents affected by spatial lag effect, while an increase of talents affected by spatial error effects was also significant. This is because in increasingly fierce talent competition, an increase in the number of talents in a certain city will likely inhibit the number of talents in its surrounding area, resulting in a negative coefficient of spatial lag. However, other factors may lead to a positive impact of talents in one city and its neighbors, such as cross-regional talent policies, positive policy responses of neighboring cities to a certain city that has already promulgated a talent policy, or a cross-city social network of talents.

Table 3 Results of OLS model and GSM models

In Model 3, when we added the amenity variables, we found that public service variables, such as SEBYEXP, EXPBYINC, and TEACHER were not significant. In addition, environment variables such as GREENING, SO2, and AMENITY did not meet our expectations. An increase in the greening rate and the improvement of natural comfort significantly inhibited the number of talents, while an increase in sulfur dioxide emissions promoted the number of talents. This is because, if we do not control for the effects of economic factors, environment variables, especially those variables representing pollution, have close relationships with the level of regional economic development. Unlike public and environment variables, traffic variables such as SPEED and RAIL were positively associated to the number of talents. Furthermore, CONSUME also had a positive impact on talents. As in Model 2, the spatial lag effect was significantly negative, while the spatial error effect was positive, and the coefficient of spatial error term was greater than that of the spatial lag term.

Model 4 considered economic opportunity variables and amenity variables together. We found that the impact of economic opportunity variables was still robustly significant. After controlling for the impacts of economic factors, environment-related variables (e.g., SO2, GREENING, WATER, and AMENITY) did not have significant impacts of the number of talents. However, among public service variables, in line with our expectations, EXPBYINC, TEACHER, and DOCTOR had links with the number of talents. Inconsistent with our expectations, SEBYEXP had a negative correlation with the number of talents. This may be because this variable only represents the degree to which the regional government attaches importance to science and education, but it does not represent the actual amount of science and education expenditure. In addition, the role of SPEED in the region was still significant. In terms of spatial overflow variables, like Model 2 and Model 3, ρ and λ were negative and positive, respectively.

Finally, Model 5 further incorporated relevant control variables, including per capita fixed asset investment, population density, the number of college students, and city administrative level. The results showed that regional gross GDP and average wage still had a significant impact on the number of talents, and their coefficients were relatively high. In addition, one cannot ignore the impact of industrial structure. In terms of amenity factors, basic public services such as education and medical care had significant associations with talents, and road accessibility in a region was positively related to the number of talents. After adding the control variables, AMENITY and CONSUME did not show significant relationships with talent distribution.

On the other hand, when we added the control variables, the spatial lag term did not pass the significance test any more, while the spatial error term was still robustly significant, and the coefficient reached 0.683. According to Model 5, it seems that the space spillover effect of highly educated talents in Chinese cities comes mainly from the impacts of the spatial error term. To be specific, it comes from influences of several unobserved factors such as cross-regional or country specific policies, policy responses, social network linkages. With a background of talent competition, cities with talent policies will more or less affect the policy-making of their surrounding cities, thus promoting the number of talents in their neighborhoods. In addition, the promotion of cross-city or country specific public service policies (e.g., Internet plus policy) has led to a reduction in barriers to the interactions between cities, leading to the spillover of the spatial distribution of talents (Miguélez et al. 2010). Last but not least, the cross-city social networks talents build also matter. The linkage of talents in neighboring cities can help to offset the risks of integration and employment for talents (Gu et al. 2018; Miguélez et al. 2010).

In general, the economic opportunity variables were significant in Model 2, Model 4, and Model 5, and the coefficients were relatively large, showing a certain degree of robustness. In terms of amenity variables, a considerable number of variables did not meet the expectations in Model 2. After considering the impacts of economic opportunities, the results show that public service variables and traffic variables significantly affected the gathering of talents, whereas there is no evidence showing that regional natural comfort and consumption made a difference. In summary, economic opportunities mainly affected the spatial cluster of talents, although one cannot ignore the impacts of public services and accessibility in amenity factors. The demand for talents of economic conditions was more fundamental, and only after the economic conditions are right will non-movable amenities such as public services become important forces for talents.

Conclusion

We first used GIS spatial analysis methods to explore the spatial distribution pattern of highly educated talents at the prefecture level. Second, we employed a GSM model to discuss the driving roles of economic opportunities and amenity in the distribution of talents, and we discussed the spatial spillover effects of talents. Our conclusions are as follows:

First, the spatial pattern of highly educated Chinese talents is highly concentrated and unbalanced. Cities with higher administrative levels and larger populations have more talents living there. The MC of the density and quantity of highly educated talents in prefecture-level cities in China is significantly positive, indicating that there is a cluster pattern of talents, where the spatial spillover effect is obvious. The result of the LISA analysis shows that the urban agglomerations are HH clusters of talent density, and a few inland growth poles are covered by HL clusters. Cities surrounding to HH clusters are detected as LH clusters, and the southeastern regions, western regions and part of central regions of China are less attracted to Talents.

Second, economic opportunities are still the main forces affecting the cluster patterns of highly educated talents. Economic opportunity variables have significant positive impacts on the number of talents. Among them, wages are the core driving force. The distribution and migration of talents depend on rational decisions involving economic opportunities. In addition, variables like regional development level and industrial structure matter.

Third, when controlling for the influence of economic opportunities, amenity variables (e.g., basic public services and accessibility of transportation) play significant roles in the spatial agglomeration of highly educated talents. Only when the requirements for economic opportunities are right will amenities work. When governments do not consider economic opportunities, the impacts of amenities on talents is not clear. In addition, variables such as natural comfort and consumption ability have no significant effect on talent agglomeration.

Fourth, the spatial spillover effect of China’s highly educated talents mainly comes from the influences of cross-regional or country specific talent, economic development policies, and social network linkages effects of talents. For a city, a talent policy may lead to a positive response from its surrounding cities, which will lead to an increase of talents in these cities. In addition, the promotion of cross-city public services has led to a reduction in barriers for talents to communicate between cities. Cross-city social networks talents build in neighboring cities have similar effects.

Discussion

Generally speaking, the unbalanced patterns of cities’ economic opportunity factors and amenity factors are the main reasons for the uneven distribution of Chinese talents. In recent years, the trend of highly educated talents gathering together is obvious. The southeastern coastal areas have attracted a large percentage of talents because of its developed economic level and the high amenities in these regions. On the one hand, the concentration of talents has had a large effect in these developed regions, and this has strengthened the knowledge spillover effect, which has stimulated the innovation and economic development of these regions (Zeng et al. 2019). On the other hand, despite the agglomeration of talents in developed areas, there is a lack of talents in the central and western regions, which has squeezed the development space of these regions.

Having realized the unbalance spatial distribution of highly educated talents, the central government should optimize this pattern by formulating appropriate talent development policies, and it should improve the efficiency of talent allocation. For highly concentrated areas of talents, we must pay attention to their positive externalities, and maybe it is not appropriate to intervene excessively through policies. However, for the central and western regions, where talent distribution is sparse, it may be necessary to formulate talent policies to give certain economic rewards to young graduates to guarantee their basic living conditions. Local governments should support preferential economic policies, such as returning hometown subsidies, returning to the countryside, and reemployment economic subsidies, and they should encourage migrants working in developed areas to return to their hometowns to work and start businesses, thereby narrowing the regions’ differences in China.

Finally, we must pay more attention to the spatial spillover effect of talents in neighboring regions. Governments should formulate a series of coordinated talent policies to strengthen the cooperation and the joint driving role within the region, considering that adjacent cities and regions may have natural geographical advantages, similar cultural genes, and adjacent social networks. The application of integrated regional talent policies is conducive to stimulating the coordinated development of regions.

Our paper still has the following shortcomings. First, the empirical study used survey data, and there is a problem with missing variables. In particular, there are some key factors we cannot include in the variables of amenities that may affect the concentration of talents. Second, this paper only builds a cross-sectional model based on the 2015 census data, and it lacks panel empirical research. Third, regarding the definition of talents, maybe highly educated talents cannot fully represent talent groups. The extent to which our research results can explain the distribution of talents needs further consideration. In the future, we will carry out research on the following aspects. We should further enrich talent data sources through field research or big data platforms to improve the accuracy of our results.