Keywords

1 Introduction

With the rise of northeast Asia economic circle, Dalian port logistics will meet greater development prospects and challenges. Port logistics demand is an important indicator in the port logistics system. Accurate prediction of port logistics demand will provide an important basis for the development of port logistics and logistics infrastructure planning. However, port logistics is a complex nonlinear system, which is influenced by social, economic, natural and other factors, and the mapping relationship between these factors cannot be described in an accurate mathematical language, which leads to the prediction difficulty.

At present, the methods used in logistics demand forecasting can be divided into two categories: qualitative forecasting and quantitative forecasting. Qualitative prediction methods mainly include expert investigation method, Delphi method, subjective probability method and so on. Qualitative forecasting method is more flexible and simpler, but it is difficult to accurately describe the logistics demand due to the influence of subjective factors. Quantitative prediction methods mainly include moving average method, regression analysis method, exponential smoothing method, grey theory model, neural network prediction model and so on. Using the GM (1, 1) model to predict the port logistics demand of Guangxi Beibu gulf, providing decision-making basis for relevant government departments [1]. Establishing the BP neural network model to predict the port logistics demand of Cao Feidian port and put forward policy Suggestions for the future development of modern logistics [2]. Considering the characteristic of logistics demand with nonlinear changes, proposing a combined forecasting model based on BP and RBF neural network, the empirical results show that the combination forecast model than single prediction model has higher prediction accuracy, reducing the probability of error effectively in larger, making the forecast results closer to reality, and propose the port logistics development planning for the future [3]. Neural network can be used to solve nonlinear and complicated prediction problems, but it has some disadvantages such as large amount of data required for training, high cost of data collection, and easy occurrence of local optima. Although the grey model requires only a small amount of data, the prediction accuracy is not high. The Grey Neural Network (GNN) model makes up for the above defects to some extent, combining the advantages of strong nonlinear fitting ability of Neural Network and high accuracy of Grey Model under the condition of small amount of calculation and few samples. The Grey BP Neural Network model has higher prediction accuracy than the single grey prediction model [4, 5]. The grey neural network model could help enterprises predict the market demand better after transportation interruption, and then empirical research tested its possibility [6]. However, due to the randomness of the determination of initial weight and threshold of GNN, the network is prone to fall into the local optimal, the results of each prediction are different and the deviation is large. Particle Swarm Optimization (PSO) is an optimization algorithm based on swarm intelligence theory, which has good robustness and global search ability. Optimization of GNN weights and thresholds by PSO can make up for the above shortcomings. Particle swarm optimization grey neural network model to predict proton exchange membrane fuel cell (PEMFC) degradation, and the results showed that the prediction accuracy of this method was high [7].

Because the port logistics demand data is relatively small and has strong non-linear relationship, it is very important to establish a suitable model to predict the future port logistics demand, according to the port logistics demand data. GNN has the advantages of strong non-linear fitting ability, small calculation amount and high calculation accuracy for small sample data. PSO has good robustness and global search ability, which is fully suitable for port logistics demand prediction. However, the standard PSO is easy to fall into local minimum point and the problem such as premature convergence, so this article on the basis of the above study, the improved PSO algorithm (IPSO) optimizes geri weis-corbley weights and thresholds by the weights of the nonlinear regressive strategy and strategy of dynamic accelerated learning factor, adjusting and balancing between global and local search capabilities, building the improved particle swarm optimization of grey neural network forecast model, and applied to port logistics demand forecasting in order to improve the prediction accuracy.

2 Prediction Model Based on IPSO-GNN

2.1 Gray Neural Network

The principle of GNN is to map the whitening equation of Grey Model (GM) into a BP neural network. When the network is trained and converges, the corresponding connection weight coefficient \(a, b_{1} , b_{2} , \ldots , b_{n - 1}\) is extracted from the trained network to obtain a whitening differential equation, and then the data fitting and prediction are performed according to this differential equation [8, 9]. The topology of the gray neural network is shown in Fig. 1.

Fig. 1
figure 1

The topology of the grey neural network

In the figure, t is the serial number of the input parameter, \(y_{2} \left( t \right), y_{3} \left( t \right), \ldots ,y_{n} \left( t \right)\) are the network input parameter, \(\omega_{21} , \ldots ,\omega_{3n}\) are the network weight, \(\omega_{11} = a,\omega_{21} = - y_{1} \left( 0 \right)\text{,}\omega_{22} = \frac{{2b_{1} }}{a}\text{,} \ldots \text{,}\omega_{2n} = \frac{{2b_{n - 1} }}{a}\text{,}\omega_{31} = \omega_{32} = \cdots = \omega_{3n} = 1 + e^{ - at} ,y_{1}\) is the network prediction value, and \(\theta = \left( {1 + e^{ - at} } \right)\left( {d - y_{1} \left( 0 \right)} \right)\) is the output node threshold of LD layer. The error criterion of the algorithm adjusts the weight to minimize the deviations. Therefore, it is crucial to obtain the initial value of \(a, b_{1} , b_{2} , \ldots , b_{n - 1}\) of GNN.

2.2 Improved Particle Swarm Optimization Algorithm

The basic idea of PSO [10, 11] is to use the information contained in each particle to express the optimal solution of the optimization problem. The important parameters affect the performance of PSO including flight speed, inertia weight and learning factor. Therefore, optimizing the inertia weight \(\omega\) and learning factor can improve the performance of the algorithm. Inertia weight \(\omega\) is used to control the exploration and convergence ability of the algorithm. When the \(\omega\) is larger, it has stronger global search ability, it is more conducive to local search when \(\omega\) is smaller. Learning factor is used to guide the algorithm to search for the optimal solution. In the early stage of optimization, particles should be encouraged to search in a larger space to keep the diversity of particles. Therefore, \(c_{1}\) is larger and \(c_{2}\) is smaller to speed up the search speed of particle swarm. In the later stage of optimization, the center of gravity of particle swarm is in the global optimal solution, so \(c_{1}\) is smaller and \(c_{2}\) is larger to keep particle swarm accurate search. Based on the algorithms proposed in literature [12,13,14], this paper firstly adopts an improved weight nonlinear descending strategy for setting the inertial weight \(\omega\), so that the PSO algorithm can better balance the global and local search capabilities. Secondly, adopting the learning factor strategy of dynamic acceleration, \(c_{1}\) and \(c_{2}\) change linearly with the number of iterations. The improved particle swarm optimization algorithm is shown as follows:

  • In this paper, inertial weight \(\omega\) can better balance and adjust the local and global search capability of PSO algorithm by introducing tangent function. The calculation formula is as follows:

    $$\omega = \omega_{\text{min} } + \left( {\omega_{\text{max} } - \omega_{\text{min} } } \right)*\tan \left( {\frac{\pi }{4}*\left( {1 - \frac{t}{{T_{\text{max} } }}} \right)^{\theta } } \right)$$
    (1)
  • The learning factor \(c_{1}\) and \(c_{2}\) of dynamic acceleration in this paper are between the minimum value and the maximum value all the time, so it is easy to avoid \(c_{1}\) and \(c_{2}\) falling into the local optimum if the learning factor is too large or too small. The calculation formula is as follows:

    $$c_{1} \left( t \right) = c_{1\hbox{max} } - \left( {c_{1\hbox{max} } - c_{1\hbox{min} } } \right)*\frac{t}{{T_{\text{max} } }}$$
    (2)
    $${\text{c}}_{2} \left( {\text{t}} \right) = {\text{c}}_{{2{ \hbox{min} }}} + \left( {{\text{c}}_{{2{ \hbox{max} }}} - {\text{c}}_{{2{ \hbox{min} }}} } \right) *\frac{\text{t}}{{{\text{T}}_{ \hbox{max} } }}$$
    (3)

Among them, \(t\) is the current iteration number, \(T_{\text{max} }\) is the maximum iteration number, \(\theta\) is the curve adjustment factor, and \(\theta\) is 2 in this algorithm after several experiments. In general, \(\omega_{\text{max} } = 1\), \(\omega_{\text{min} } = 0.5\), the values of \(c_{1\hbox{min} }\) and \(c_{2\hbox{min} }\) are 1, and the values of \(c_{1\hbox{max} }\) and \(c_{2\hbox{max} }\) are 2.

The updated formula of particle velocity and position of the improved particle swarm optimization algorithm are as follows:

$$v_{id} \left( {t + 1} \right) = v_{id} \left( t \right) + c_{1} \left( t \right)r_{1} \left( {p_{id} - x_{id} \left( t \right)} \right) + c_{2} \left( t \right)r_{2} \left( {p_{gd} - x_{id} \left( t \right)} \right)$$
(4)
$$x_{id} \left( {t + 1} \right) = x_{id} \left( t \right) + v_{id} \left( {t + 1} \right)$$
(5)

Among them, \(r_{1}\) and \(r_{2}\) is the random number [0, 1], \(x_{id}\) is the current position of the particle, \(p_{id}\) is the optimal solution of the particle history, and \(p_{gd}\) is the global optimal solution.

2.3 Improved Particle Swarm Optimization Algorithm to Optimize Grey Neural Network

GNN has a fast convergence speed, but its initial weights and thresholds are determined randomly, so that the network is prone to local optimization [14]. Therefore, Improved Particle Swarm Optimization algorithm can obtain the optimal GNN parameters. IPSO-GNN algorithm uses IPSO algorithm to optimize the \(a, b_{1} , b_{2} , \ldots , b_{n - 1}\) parameters of GNN, and takes the mean square error of training samples as the function of individual adaptive value. The global optimal solution as the initial weight and threshold of GNN to train the network, so that achieve the training goal of the network. The specific algorithm steps are as follows:

  1. (1)

    Divide the original data into training samples and test samples;

  2. (2)

    Initialization parameters: initialize the maximum and minimum value of the learning factor of the particle, the maximum and minimum value of the inertial weight, the position and speed of the particle, and assign a set of parameters corresponding to each position the initial particle passes through. Since the initial weight and threshold of GNN are determined by equal n parameters, the dimension of the particle swarm is D = n.

  3. (3)

    Set the fitness function of particle swarm, prediction mean square error (MSE) of the training sample as the function of individual adaptive value. Because there is only one output neuron of GNN, formula can be simplified as: \(MSE = \frac{1}{n}\sum\nolimits_{i = 1}^{n} {\left( {o_{i} - y_{i} } \right)}^{2}\), among them, n as the number of training samples, \(o_{i}\) is the actual output of the \({\text{i}}\) th sample, \(y_{i}\) is the expected output of \({\text{i}}\) th sample;

  4. (4)

    Compare and analyse the fitness value of each particle and its corresponding optimal value, and then judge whether satisfying the condition of iteration. If so, these are the optimal parameter combination. Go to step (6); otherwise, go to step (5);

  5. (5)

    According to formula (4) and (5), the particle speed and position are repeatedly update, and judge if meet the optimal solution conditions. When meeting the conditions that the minimum precision value of fitness or the maximum number of iterations, go to step (6); otherwise, go to step (5);

  6. (6)

    Obtain the optimal parameters;

  7. (7)

    Substitute the optimal parameters as the initial weight and threshold of GNN into the network for training until the training error (or iteration times) of the network reaches the predetermined value. The IPSO-GNN algorithm flow shows in Fig. 2.

    Fig. 2
    figure 2

    The IPSO-GNN algorithm flow

3 Selection of Logistics Demand Forecast Index of Dalian Port

3.1 Influencing Factors Analysis of Dalian Port Logistics Demand Forecast

Zhu et al. Select four indicators that are the added value of the primary industry, the added value of the secondary industry, the total amount of import and export, and the investment in fixed assets of the whole society as the indicators to predict the port cargo logistics demand [1]. Gao et al. select the port city GDP, industrial GDP, tertiary industry GDP, total foreign trade, total retail sales, per capita income, and per capita consumption level as the indicators affecting the port logistics demand of Feidian [2]. Cai and Huang Select the three industrial values, total imports and exports, total retail sales of consumer goods and fixed asset investment in the direct economic hinterland as the influencing factors of the logistics demand of Shantou port [3]. Basing on relevant literature, because the port logistics demand amount is closely related to the hinterland economic aggregate, which can predict port logistics demand rely on the correlation between the two. combined with the actual situation of Dalian port logistics development, this paper selects the total regional GDP (\(a_{1}\)), added value of primary industry (\(a_{2}\)), added value of secondary industry (\(a_{3}\)), added value of tertiary industry (\(a_{4}\)), fixed asset investment of the whole society(\(a_{5}\)), total retail sales of consumer goods (\(a_{6}\)), total imports and exports (\(a_{7}\)), annual disposable income (\(a_{8}\)), and annual per capita consumption expenditure (\(a_{9}\)) nine indicators to predict the cargo logistics demand in Dalian Port. The statistical data of logistics demand impact indicators of Dalian port in 2001–2018 is shown in Table 1. Among them, the unit of \(a_{1}\), \(a_{3}\), \(a_{4}\) and \(a_{6}\) are RMB 100 million, the unit of \(a_{5}\) and \(a_{7}\) are dollars 100 million, the unit of \(a_{8}\) and \(a_{9}\) are RMB, the unit of \(a_{0}\) is 100 million tons.

Table 1 Statistics of logistics demand impact indicators of Dalian port in 2001–2018

3.2 Analysis and Selection of Logistics Demand Index of Dalian Port

Grey relation analysis is based on the grey system theory proposed by Professor Julong Deng. Processing the various factors through data in incomplete information to find the correlation degree among them [15,16,17]. Introducing grey correlation analysis method is to further determine the correlation among the logistics demand of Dalian port and various impact indicators. The statistical data of the logistics demand impact indicators of Dalian port is shown in Table 1. It is necessary to select the indicators that have a significant impact on the logistics demand of Dalian port, namely the key indicators (the correlation degree is greater than 0.6). The correlation degree among each indicator and the logistics demand of Dalian port is determined by Calculated by DPS software, the specific results are as follows:

$$\begin{aligned} \rho_{a1} & = 0.6597 \rho_{a2} = 0.7207 \rho_{a3} = 0.5616 \\ \rho_{a4} & = 0.6578 \rho_{a5} = 0.3838 \\ \end{aligned}$$
$$\begin{aligned} \rho_{a6} & = 0.5823 \rho_{a7} = 0.6475 \rho_{a8} = 0.6406 \\ \rho_{a9} & = 0.6793 \\ \end{aligned}$$

The larger the correlation value, the greater the impact of this index on the logistics demand of Dalian port is. By sorting the above correlation values, we can get: the values of \(a_{2} , a_{4} , a_{9} ,a_{1} ,a_{7} \,\,{\text{and}}\,\, a_{8}\) six variables are relatively large, that is to say, the added value of the primary industry, the added value of the tertiary industry, the annual per capita consumption expenditure, the GDP of the whole region, the total amount of import and export, and the annual disposable income six indicators are selected as the key indicators. The logistics demand forecast index set of Dalian port is shown in Table 2.

Table 2 Dalian port logistics demand forecast index set

4 Logistics Demand Prediction Dalian Port Based on IPSO-GNN

4.1 Establish a Prediction Model Based on IPSO-GNN

Take the data in Table 1 from 2001 to 2015 as the network training data, and the data in 2016–2018 as the network test data. In order to prevent the net input absolute value too large cause the saturation of neuron output, so that reduce the convergence of training network, it is necessary to make Dalian port logistics demand indicators statistics normalization. IPSO-GNN model sets six input variables and one output variable, and network hidden layer adopts multi-layer deep learning mode.

4.2 Model Prediction Results and Analysis

Empirical research adopts the gradual model analysis method, including BP neural network model, GM (1, 1) model, GNN model, standard particle swarm optimization grey neural network (PSO-GNN) model and IPSO-GNN model to predict the logistics demand of Dalian port. Using matlabr2012b programming software realize the above algorithm. The calculation results of prediction error and mean square error of each prediction model are shown in Table 3, in which the unit of actual values and prediction values are 100 million yuan.

Table 3 Comparison of simulation results

From the Table 3, the following conclusions can be drawn:

  • From the comparison of real and predicted values in 2016–2018, the mean error of prediction value under BP neural network model, grey GM (1,1) model, GNN model, PSO-GNN model and IPSO-GNN model are 11.12%, 24.9%, 5%, 2.96% and 1.71% respectively, the mean square errors are 0.2622, 1.4138, 0.0582, 0.0184 and 0.0076. The IPSO-GNN prediction model proposed in this paper has the smallest deviation and the highest accuracy from the original data, which is suitable for the prediction of port logistics demand.

  • Although there are random fluctuations in the original port logistics demand, the mean error and mean square error of the model in this paper are the smallest. With the increase of the forecast year, the relative error of the forecast results than other forecast models have volatility, while the relative error of the forecast results of the algorithm in this paper decreases year by year, with high stability. The IPSO-GNN model prediction results are in good agreement with the original port logistics demand data, which proves the stability of the algorithm.

5 Conclusions

Based on the study of particle swarm optimization and grey neural network, combined with the characteristics of port logistics demand forecasting, this paper applies innovatively the improved particle swarm optimization grey neural network model to port logistics demand forecasting. The improved particle swarm algorithm optimizes the weight and threshold value of the grey neural network to improve the prediction accuracy, which effectively solves the problems that randomness given by the initial parameters of the grey neural network and easy to fall into local optimum. This present paper compares with BP neural network model, GM (1, 1) model, GNN model, PSO-GNN model, IPSO-GNN model in Dalian port logistics demand prediction results, shows that IPSO-GNN prediction model can significantly improve the prediction accuracy of port logistics demand, and the model stability is stronger.