Introduction

Land is a precious resource. Pressures on land use have inevitably intensified with the land being a fixed resource and with a growing population (Maria et al. 2016; Yang et al. 2018a, b, c; He et al. 2019). In order to adapt to the new norms of economic development, optimize the land supply structure, and ensure the rational and healthy development of urban and rural construction lands, China has successively issued a series of related policy documents. According to the “Guidance on Land Conservation and Intensive Use” released by the Ministry of Land and Resources, China, in 2014, transformation and optimization of land use patterns and structures should promote transformation and optimization of economic development. Based on coordination of urban space, scale, and industry proposed by Chinese government in 2015, urban land supply structure and industrial structure evolution should match each other. According to the “National Overall Land Use Plan (2006~2020)” released by the Ministry of Land and Resources in 2016, Beijing-Tianjin-Hebei Urban Agglomeration should coordinate industrial land allocation and control urban-industrial land sprawl.

According to the “Code for Classification of Urban Land Use and Planning Standards of Development Land (GB 50137–2011)” (Ministry of Housing and Urban-Rural Development of China 2011), urban construction land can be subdivided into multiple categories. Research on specific urban construction land prediction is increasing gradually. Nevertheless, a comprehensive study on the prediction of total urban construction land and specific urban construction land is yet to be carried out. Moreover, specific urban construction land prediction not only emerges late but also the research methods and contents need to be improved. Urban-industrial land is the main type of urban construction land (Lin 2009; Xiao 2012; Shu and Xiong 2019) and is also the main space for urban non-agricultural activities (Zhao 2012; Xie et al. 2014; Yong et al. 2018). As an expected indicator in land use planning, urban-industrial land is used to provide the judging criteria for construction land scale in the planning period when urban population, industrial development, investment scale, and other conditions are uncertain (Qiu et al. 2015). Regulation zone refers to further arrangements of land use, population distribution, public facilities, and urban infrastructure allocation in local areas based on the overall urban planning (Panigrahi and Mohanty 2012). After the completion of the urban master plan, large- and medium-sized cities can prepare regulation zoning plans according to their needs. The regulation zone should be carried out simultaneously within the urban area. The zone should be coordinated in time during the preparation process (Pignatti et al. 2016).

Land demand prediction is a complex decision-making process with many random and disorderly factors (Bartoli et al. 2016; Hermanns et al. 2017; Yang et al. 2018a, b, c). Agricultural land allocation involves assessing land potential and land demand for various crops to determine the best land unit for each type. The current allocation model tends to focus on the perspective of land demand growth (Pilehforooshha et al. 2014). In order to achieve the above objectives, scholars used Markov chain (Aurbacher and Dabbert 2011), GIS raster analysis (Pilehforooshha et al. 2014), cellular automata (Kelly et al. 2018; Yang et al. 2019), goal programming (António et al. 2018), fuzzy rule–based system (Reshmidevi et al. 2009), etc.

The rapid urbanization makes urban construction land demand increasing, so it is of great significance to predict urban construction land growth (Kun et al. 2018; Yang et al. 2018a, b, c). The prediction of urban construction land growth is the basic work of urban overall land use plan. At present, research on urban construction land growth forecast can be divided into two types: (1) simulation of land temporal and spatial sprawl and (2) prediction of land scale. As early as the 1960s, urban models were widely used in urban development research, and with the support of new technologies, it was gradually possible to simulate the future development of cities. However, it was not until the CA (cellular automata) model appeared that the two-dimensional spatial feature of urban sprawl was realized. The prediction research mainly depends on spatial data and is constrained by the expansion rules. The scale prediction of urban construction land growth is based on different prediction indexes and mathematical models. The main predictors are population and socioeconomic indicators, such as non-agricultural population, total fixed assets, and GDP (Hao et al. 2014). In addition, urban construction land growth is realized by the occupation of some or most of the cultivated land, and some scholars predict urban construction land growth in respect of conversion between cultivated land and urban construction land (Zhai et al. 2015; He et al. 2018). In sum, urban construction land growth is a nonlinear and complex process influenced by various factors, and some rules, indicators, or models are needed in the prediction process, although the prediction is uncertain (Yang et al. 2019). Therefore, it provides a scientific reference for the study of land scale demand in urban development.

Recently, the actual urban-industrial land area of some cities in Beijing-Tianjin-Hebei Urban Agglomeration has exceeded the expected indicators in the overall land use plan (Li et al. 2019b). The expected indicators in the overall land use plan are indicators that should be achieved during the planning period according to economic and social development forecasts, such as total size of construction land (Li et al. 2019a). Research on expected indicator such as urban-industrial land in overall land use planning mainly focuses on qualitative analysis; quantitative analysis research has not yet been carried out. In this paper, the prediction method of mathematical models (neural network model and gray model) is used to predict urban-industrial land demand of cities in Beijing-Tianjin-Hebei Urban Agglomeration. Using MATLAB R2016a software modeling tools to establish GM (1, 1) model and RBF neural network model, respectively, the demand of urban-industrial land in Beijing-Tianjin-Hebei Urban Agglomeration is predicted. After testing the accuracy of the two prediction models and comparing the predicated results with the actual value of urban-industrial land in Beijing, Tianjin, and 11 prefecture-level cities in Hebei Province, the reasonable prediction model for urban-industrial land is determined. Based on the results of prediction, urban-industrial land regulation zones are divided. The policy implications for urban-industrial land in each regulation zone are provided.

Study area and methods

Study area

Beijing-Tianjin-Hebei Urban Agglomeration, covering a surface area of 217,158 km2, is an important core area of northern China, including two municipalities directly under the Central Government of Beijing and Tianjin, and 11 prefecture-level cities of Hebei Province (including Shijiazhuang, Tangshan, Qinhuangdao, Handan, Xingtai, Baoding, Zhangjiakou, Chengde, Cangzhou, Langfang, and Hengshui) (Fig. 1). According to the overall land use plan of Beijing, Tianjin, and Hebei, there are significant differences in the total amount of urban-industrial land of cities in Beijing-Tianjin-Hebei Urban Agglomeration from 2009 to 2015 (Fig. 2). In 2015, the total area of urban-industrial land in Tianjin, Tangshan, and Beijing reached 1974.67 km2, 1747.33 km2, and 1452.01 km2, respectively. Based on land use survey, problems on scattered layout, irrational structure, and low utilization efficiency of urban-industrial land have emerged in many cities of Beijing-Tianjin-Hebei Urban Agglomeration. Unreasonable use of urban-industrial land has become a roadblock for the sustainable land use in the Beijing-Tianjin-Hebei Urban Agglomeration.

Fig. 1
figure 1

Location map of the study area

Fig. 2
figure 2

The area of industrial and mining land of cities in Beijing-Tianjin-Hebei Urban Agglomeration from 2009 to 2015

Methods

The mathematical method mainly includes the neural network model and the gray model. Artificial neural network has become an important way to process information. Using the neural network model and the gray model’s highly nonlinear mapping ability can predict the construction land demand. Due to the high degree of nonlinear mapping capability, the artificial neural network can realize the nonlinear mapping of the dependent variable and the independent variable without the relationship between the dependent variable and the independent variable (Zhang et al. 2018).

Gray relational analysis

In order to measure the correlation degree of various related factors and overcome the defects of traditional multivariate correlation analysis, gray correlation analysis method was used to select factors for urban-industrial land demand prediction (Hao et al. 2014; Li et al. 2018; Fu et al. 2019). This method mainly includes 3 steps (Yin et al. 2018):

  1. 1.

    Data conversion. Convert raw data into dimensionless and comparable data. Assumed reference data were listed as follows:

$$ {x}_0=\left({x}_0(1),{x}_0(2),{x}_0(3),\dots, {x}_0(n)\right) $$

Comparative data were listed as follows:

$$ {x}_i=\left({x}_i(1),{x}_i(2),{x}_i(3),\dots, {x}_i(n)\right),\left(i=1,2,\dots, n\right) $$
  1. 2.

    Calculate correlation coefficient in Eq. (1).

$$ {l}_{oi(t)}=\frac{\Delta _{\mathrm{min}}+\rho {\Delta }_{\mathrm{max}}}{\Delta _{oi(k)}+\rho {\Delta }_{\mathrm{max}}} $$
(1)

where loi(k) is the absolute difference between two data in time t, namely, loi(k) = |x0(t) − xi(t)|(1《in); max and min refers to the maximum and minimum values of the absolute differences for all data, respectively. Because comparison data intersects each other, min = 0 and ρ is the resolution coefficient, it could increase the significance of difference between the relevant data, ρϵ(0, 1); ρ is used to prefer 0.5 in the previous research (Yin et al. 2018).

  1. 3.

    Relation testing. Based on loi(k), relation, which is the average value of correlation coefficients, can be calculated in Eq. (2):

$$ {r}_{oi}=\frac{1}{N}{\sum}_{k=1}^N{l}_{oi(t)} $$
(2)

where roi is the relation between sub-sequence i and total sequence o. N is the length of the comparative sequence (number of data).

GM (1, 1) prediction model

Based on gray system theory, GM (1, 1) prediction model can use insufficient information to build a model with possible full information by transforming time series into differential equations through differential fitting method (Chen and Pai 2015). GM (1, 1) prediction model could be expressed in Eq. (3):

$$ {\hat{X}}^{(1)}(t)=\left({X}^{(0)}(1)-\frac{u}{a}{e}^{-a\left(t-1\right)}\right)+\frac{u}{a} $$
(3)

where X(0)(1) is the raw data in urban-industrial land demand system. \( {\hat{X}}^{(1)}(t) \) is the accumulated value of raw data in urban-industrial land demand system. t refers to time. a and u are the development gray number and the endogenous control gray number.

RBF neural network prediction model

Compared with the widely used BP neural network model, RBF neural network prediction model is more advanced in terms of learning speed and parameter settings with a single hidden layer feedforward network (Ghritlahre and Prasad 2018). The framework of RBF neural network prediction model is shown in Fig. 3.

Fig. 3
figure 3

Framework of RBF neural network model

Learning process of RBF includes two stages: unsupervised learning and supervised learning. The main calculation process includes three steps.

  1. 1.

    Calculate the output value Yb of the neuron b in the output layer in Eq. (4):

$$ {Y}_a=f\left({\sum}_{a=1}^r{W}_{ba}{Y}_{ha}\right) $$
(4)

where Yha is the output value of the neuron a in the hidden layer. Wba is the weight from the neuron a in the hidden layer to the neuron b in the output layer. We use Sigmoid to express the function in Eq. (5):

$$ f(x)=1/\left(1+\exp \left(-x/{x}_0\right)\right) $$
(5)
  1. 2.

    Calculate error of the output layer in Eq. (6):

$$ \Delta b={Y}_b\cdot \left(1-{Y}_b\right)\cdot \left({y}_b-{Y}_b\right) $$
(6)

where yb is the actual value of the neuron b.

  1. 3.

    Adjust weight coefficient ΔW until the error is minimized. It can be express in Eq. (7):

$$ \Delta W=\varepsilon \cdot \Delta {Y}_b\left(1-{Y}_b\right)\cdot \left({y}_b-{Y}_b\right),{W}_b^{\prime }={W}_b+\Delta W $$
(7)

where \( {W}_b^{\prime } \) is the adjusted weight. ε is the learning rate. ε is selected based on the comparison method. The value of ε ranges between 0.01 and 0.8 (Monteiro et al. 2018). ε is set as 0.03 in this study. The model can be used for prediction when cluster center Ca and weight Wa are defined.

Methods for prediction accuracy testing

The accuracy of the prediction model is verified by the mean absolute error (MAE) and the error root mean square (RMSE). They are expressed in Eqs. (8) and (9):

$$ \mathrm{MAE}=\frac{1}{n}{\sum}_{i=1}^n\left|\hat{Z_i}-{Z}_i\right| $$
(8)
$$ \mathrm{RMSE}=\sqrt{\frac{1}{n}{\sum}_{i=1}^n{\left(\hat{Z_i}-{Z}_i\right)}^2} $$
(9)

where \( \hat{Z_i} \) is the predictive value by the RBF model. Zi is the actual value. n is the total number of samples taken for training and testing. The smaller the value of MAE and RMSE is, the smaller the error is, and the higher the prediction accuracy of the model is.

Data collection and processing

Data collection

Data about urban-industrial land from 2009 to 2015 were collected from the national land utilization conveyance data. Statistical data related to socioeconomic conditions were collected from Beijing Statistical Yearbook (2010~2016) (Beijing Bureau of Statistics 2017), Tianjin Statistical Yearbook (2010~2016) (Tianjin Bureau of Statistics 2017), Hebei Statistical Yearbook (2010~2016) (Hebei Bureau of Statistics 2017), and China Statistical Yearbook for Regional Economy (2010~2016) (NBSC 2017). Expected data of urban-industrial land in the planning target year 2020 were obtained from the overall land use plan (2006–2020) of Beijing, Tianjin, as well as 11 prefecture-level cities in Hebei Province.

Data processing

According to national land utilization conveyance data originally derived from the first national land survey in 1996, urban-industrial land, including urban land, town land, rural residential land, and independent industrial and mining land, is the space that carries the main production and living activities of urban and rural residents (Lin 2009). Urban-industrial land, similar to urban construction land, including urban land, town land, and independent industrial and mining land, is in line with the urban population in terms of statistical caliber (Li et al. 2018). The main factors affecting urban construction land change are economic growth, the secondary and tertiary industry development, and urbanization (Chen et al. 2017). Eleven indicators, including GDP (x1), the secondary industry added value (x2), the tertiary industry added value (x3), total investment in urban fixed assets (x4), real estate development investment (x5), per capita disposable income of urban residents (x6), per capita consumption expenditure of urban residents (x7), highway mileage (x8), number of industrial enterprises above designated size (x9), industrial output (x10), and the total retail sales of social consumer goods (x11), were selected from four respects of economic growth, industrialization, industrial structure adjustment and upgrading, and urbanization.

Each indicator was converted by using the extremum method. The calculation processes were expressed in Eqs. (10) and (11).

$$ x=\frac{\left[{x}_{ij}-\min \left({x}_j\right)\right]}{\left[\max \left({x}_j\right)-\min \left({x}_j\right)\right]} $$
(10)
$$ x=\frac{\left[\max \left({x}_j\right)-{x}_{ij}\right]}{\left[\max \left({x}_j\right)-\min \left({x}_j\right)\right]} $$
(11)

where x is the value of indicator xij processed by the extremum method; xij is the actual value of indicator i in the year j; max (xj) is the maximum actual value of indicator i in the year j; and min (xj) is the minimum actual value of indicator i in the year j.

GM (1, 1) model was established by using the MATLAB R2016a software modeling tool. According to the principle of RBF neural network model, using Newrb toolbox in the MATLAB R2016a software, RBF neural network model, where the SPREAD value in the Newrb function was chosen to be the default value 1 (Morteza et al. 2018), was established by choosing 11 influencing factors as input samples and urban-industrial land demand as output samples. Taking actual area of urban-industrial land from 2009 to 2014 and its influencing factors as training samples, the area of urban-industrial land from 2010 to 2015 was simulated. The simulation results showed that RBF neural network model, of which the output could well approximate the nonlinear function Yj after repeated tests, was provided with an excellent fitting ability. Urban-industrial land data from 2010 to 2015 of cities in Beijing-Tianjin-Hebei Urban Agglomeration were imported from the GM (1, 1) model and RBF neural network model, respectively. The conditions for the termination of training in the model is set in terms of the actual situation (Morteza et al. 2018). The maximum number of training cycles is 5000 in this study.

Results and discussion

Gray relational grade of the indicators

Gray relational grades of the indicators were conducted by using gray relation system software developed by Liu et al. (2004). The results of gray relational grades showed that the correlation value of each influencing factor, which is greater than 0.7, met three-level precision (Table 1). Therefore, there was a relatively strong correlation between the 11 factors and the demand for urban-industrial land, namely, the 11 factors were applicable to the demand prediction of urban-industrial land in Beijing-Tianjin-Hebei Urban Agglomeration.

Table 1 The gray correlation degree of urban-industrial land and driving factors of cities in Beijing-Tianjin-Hebei Urban Agglomeration

Demand prediction of urban-industrial land

Results of the GM (1,1) prediction model

The prediction results of urban-industrial land area from 2009 to 2015 by using the GM (1, 1) model are shown in Table 2.

Table 2 GM (1, 1) model prediction results

Results of RBF neural network prediction model

The prediction results of urban-industrial land area from 2009 to 2015 by using the RBF neural network model are shown in Table 3 and Fig. 4.

Table 3 RBF model prediction results
Fig. 4
figure 4

RBF model prediction simulation results. PV: predicted value; Av: actual value; a–m: Beijing, Tianjin, Shijiazhuang, Tangshan, Qinhuangdao, Handan, Xingtai, Baoding, Zhangjiakou, Chengde, Cangzhou, Langfang, and Hengshui

Prediction accuracy testing for the two models

Statistic error results of the two models showed that the RBF neural network model, of which the prediction accuracy was greater than that of the GM (1, 1) model, was more suitable for urban-industrial land demand prediction (Table 4). The RBF neural network model was selected to predict the demand of urban-industrial land. Hao et al. (2014) found that construction land demand could be predicted by using the RBF neural network. Therefore, artificial intelligence approach, including artificial neural network, has been confirmed to be an effective method to predict land resources demand.

Table 4 Statistical errors of the two models

Regulation zoning of urban-industrial land

A city is a unified organism, and the various parts of the function which are used in different functions are reasonably organized (Panigrahi and Mohanty 2012). When starting to plan a city, according to the different characteristics and requirements of the city, comprehensive planning should be carried out for the industrial land, residential land, transportation land, etc. However, a series of problems will emerge in cities without a regulation zone of urban land. Li et al. (2019a) proposed that urban-industrial land use efficiency of Beijing-Tianjin-Hebei Urban Agglomeration was relatively low. Therefore, zoning is necessary in urban-industrial land in Beijing-Tianjin-Hebei Urban Agglomeration.

Based on the established RBF neural network model, 13 cities in Beijing-Tianjin-Hebei Urban Agglomeration have been trained and predicted with different models; the urban-industrial land demand of the 13 cities in 2020 was predicted. The comparison of the predicting results and expected indicators of urban-industrial land in the overall land use plan of cities in Beijing-Tianjin-Hebei Urban Agglomeration in 2020 showed that the expected indicator of urban-industrial land is approximately equal to the predicted value by the RBF neural network model. Those indicated that the RBF neural network model was available (Table 5). The predicted values of Tianjin, Tangshan, Cangzhou, Zhangjiakou, and Chengde were obviously different from the expected values, which reflected the deficiency of the overall land use plan.

Table 5 Comparisons between the predicated demand of urban-industrial land and the expected indicators of urban-industrial land in the overall land use plan in 2020

Combined with the actual scale of urban-industrial land in 2015 and the predicated scale of urban-industrial land in 2020, the remaining usable time of each city’s urban-industrial land was calculated in terms of the average annual growth rate of urban-industrial land from 2009 to 2015. According to the comparative relationship between the remaining usable time and the remaining time of the overall land use plan (5 years), the ArcGIS natural breakpoint method was used to divide regulation zones of urban-industrial land. Three regulation zones were obtained (Fig. 5):

  1. (1)

    Reasonable reduction zone (0.55 year ≤ remaining usable time ≤ 2.68 years). This zone covers Tianjin, Tangshan, Zhangjiakou, and Chengde;

  2. (2)

    Optimized adjustment zone (2.68 years ≤ remaining usable time ≤ 4.36 years). This zone covers Beijing, Cangzhou, and Hengshui;

  3. (3)

    Core development zone (4.36 years ≤ remaining usable time ≤ 6.22 years). This zone covers Shijiazhuang, Baoding, Xingtai, Langfang, Handan, and Qinhuangdao.

Fig. 5
figure 5

Regulation zones of urban-industrial land of Beijing-Tianjin-Hebei Urban Agglomeration

Policy implications

The policy implications for urban-industrial land in the three regulation zones are put forward as follows:

  1. (1)

    Reasonable reduction zone. Covering Tianjin, Tangshan, Zhangjiakou, and Chengde. This zone, of which the actual scale of urban-industrial land has exceeded the expected target of the overall land use plan, should implement measures for reduction of urban-industrial land. Tianjin and Tangshan are important industrial cities in China, and they are also one of the important carrying places for the industrial transfer of Beijing-Tianjin-Hebei cooperative development. The proportion of industrial land among urban-industrial land is relatively high. It is urgent to promote the internal potential of urban-industrial land and improve the level of intensive utilization of urban-industrial land (Li et al. 2019a). Although the intensity of urban-industrial land development in Zhangjiakou and Chengde is not high, it is very important to have a large number of ecological functions in these regions. With frequent ecological vulnerability and high geological hazards, these regions are not suitable for large-scale development and construction of urban-industrial land.

  2. (2)

    Optimized adjustment zone. Covering Beijing, Cangzhou, and Hengshui. This zone, of which the surplus scale of urban-industrial land is relatively insufficient, should activate the existing urban-industrial land, promote the adjustment of industrial structure, and carry out land transformation of old towns, old villages, and old factories with the historical opportunities of industrial transfer and non-capital functions in the coordinated development of Beijing, Tianjin, and Hebei. As the main industry transfer place of Beijing-Tianjin-Hebei coordinated development, Beijing should activate the stock of construction land, reduce the proportion of industrial and storage land, and optimize land distribution with the requirements of “orderly relaxing the non-capital functions of Beijing, optimizing and upgrading the core functions of the capital.” Formulate policies to guide land withdrawal for general industries, especially “two high” industries (high pollution, high energy consumption), labor-intensive industries, partial public service functions, partial service agencies, and enterprise headquarters.

  3. (3)

    Core development zone. Covering Shijiazhuang, Baoding, Xingtai, Langfang, Handan, and Qinhuangdao. The expanding space of urban-industrial land of this zone is relatively larger. At present, the intensity of development is moderate, the resources and environment are relatively low, and there is some suitable space for construction. Meanwhile, according to the industrial belt, the industrial chain, and the specific industrial undertaking platform proposed by the guidelines on industrial transfer among Beijing, Tianjin, and Hebei and the opinions on strengthening the construction of key platforms for industrial transfer among Beijing, Tianjin, and Hebei, the region is along the development axis of Beijing and Tianjin, development axis of Beijing-Bao-Shi, and the development axis of Beijing-Tang-Qin, and is the main region to undertake the industries transfer of Beijing-Tianjin-Hebei. Owing potential for future development and should guide the rational agglomeration of industries, the scale of urban-industrial land should be moderately increased in the region.

Because the spatial data of urban-industrial land of the study area are not available, this paper only predicts urban-industrial land in respect of quantitative scale. It could be studied in respect of spatial scale in the future. The spatial distribution of urban-industrial land could be predicted with the quantitative scale, so as to further improve its effectiveness.

Conclusions

Using the MATLAB R2016a software modeling tools to establish the GM (1, 1) model and RBF neural network model, respectively, this paper predicted the demand of urban-industrial land in Beijing-Tianjin-Hebei Urban Agglomeration. Comparing the predicted results with the actual value of urban-industrial land in Beijing, Tianjin, and 11 prefecture-level cities in Hebei Province, we determined the reasonable prediction model for urban-industrial land after testing the accuracy of the two prediction models. The results showed that RBF neural network model was the more reasonable prediction model for urban-industrial land. Using the predicted results of the RBF neural network model, combining expected indicators of “Overall Land Use Plan (2006–2020)” in Beijing and Tianjin, as well as in 11 prefecture-level cities in Hebei Province in the planning target year, determined remaining usable time of urban-industrial land. Finally, combined with the actual scale of urban-industrial land in 2015 and the predicated scale of urban-industrial land in 2020, the remaining usable time of each city’s urban-industrial land was calculated in terms of the average annual growth rate of urban-industrial land from 2009 to 2015. According to the comparative relationship between the remaining usable time and the remaining time of the overall land use plan (5 years), urban-industrial lands were divided into three kinds of regulation zones: reasonable reduction zone, optimized adjustment zone, and core development zone. This paper can provide reference for regulation zoning of urban-industrial land in developing countries and regions.