1 Introduction

The silicon content of the molten iron, which is the final product of the blast furnace, not only reflects the quality of the molten iron, but also characterizes the thermal state of the blast furnace. Therefore, the accurate prediction of the silicon content of the molten iron is very important for the operation of the blast furnace [1]. However, the blast furnace smelting system involves a complex set of heat transfer and chemical reactions, with internal momentum, heat, and mass transfer coupling. During smelting, blast furnace gases rise and fall; the internal phase transformation of gas–solid–liquid phase-oriented fluidity of the blast furnace constantly evolves in an extremely complex process with nonlinear, strong coupling and large time delay characteristics. Relying solely on mechanistic analysis and mathematical models of the smelting process makes continuous and accurate prediction of the molten iron silicon content difficult [2,3,4]. Therefore, effectively using technological innovation-driven ironmaking blast furnaces and achieve continuous prediction of the molten iron silicon content is an increasingly important requirement for intelligent manufacturing in the steel industry.

Large-scale data have been linked, integrated, and fused, providing renewed impetus and opportunities for collaborative development in various industries. Given the rapid technological changes and enormous data availability, exploring fusion data techniques to support complex decision-making and provide deep insight through data mining has become key support and core competitive advantage [5]. Global science and technology development and national strategic needs have focused increasingly on big-data management. Moreover, the metallurgical industry in China is facing problems under the “new normal” of the Chinese economy. Consequently, the harsh market conditions have forced the steel industry to vigorously develop intelligent metallurgical manufacturing technologies with data at their core.

China’s steel industry generates multi-source multimodal data, accurately portraying the production process. Most data are discarded as they are considered “invalid” [6], but these data can be rich in information, making it important to combine different modal data sources to conduct deep data mining and fusion modeling [7]. Existing blast furnace big-data research uses artificial intelligence algorithms to model complex processes, ignoring complex process mechanisms and using data-driven technology to determine quantitative relationships between variables. Based on the general rules of blast furnace smelting process mechanism and transient process, a model was designed to predict the silicon content of iron 10 h ahead of time based on the specific blast furnace boundary conditions, taking into account the iron ore raw material and coke quality. The root-mean-square deviation between the obtained predicted iron silicon content and the actual measured value is 0.09% [8]. In Ref. [9], a new multi-input multi-output (MIMO) Takagi–Sugeno (T-S) fuzzy model was proposed using an output transfer matrix. By explicitly modeling inter-indicator correlations through low-rank learning of the correlation matrix, the great challenge of jointly determining the MIMO T-S model and the fuzzy rules for inter-indicator correlations is overcome. And a better prediction result for the silicon content of molten iron was achieved even with a sample missing rate of 90%. In order to overcome the difficulties caused by the nonlinear time-varying dynamics of the process and to enable random vector functional-link network (RVFLN) to learn online and avoid data saturation, by sequential learning with forgetting factors and applying the Cauchy distribution function to properly evaluate the weights of different data, good modeling accuracy was achieved for outlier contamination rates up to 50% and for anomaly magnitudes up to 5 times the difference between the maximum and minimum values of each output [10]. The T-S fuzzy modeling method based on Bayesian block structure sparsity for continuous prediction of silicon content of molten iron. By this method, the main important fuzzy rules and the corresponding pivotal result parameters can be automatically selected to obtain compact fuzzy models with good generalization performance [11]. The effect of parameter variations on the silicon content was determined by differential processing and correlation analysis [12]. The Elman silicon content prediction model and logistic furnace temperature fluctuation prediction model were developed [13]. The adaptive density peak clustering of the Bonferroni index and a multi-source path finding algorithm were combined to construct a method to predict the silicon content of molten iron in blast furnace based on optimal work migration [14].

Based on the above analysis, it is important to understand the meaning of data-driven technology: "deep mining of universal laws among sample information in known, incomplete or imprecise measurement results of big data (samples)" [15]. The data information collected during the blast furnace smelting process can be divided into "control variables" and "state variables" according to their functions; the data information collected during the blast furnace smelting process can also be divided into "sample input" and "sample output" according to the structure of the data-driven model sample set. For the "sample input" and the "sample output", the sample input index data are ahead of the sample output and are easier to obtain than the output index. Obviously, the sample input is the state variable, which is ahead of the sample output (silicon content of molten iron) for continuous prediction and needs to consider the control variables such as coal injection volume and state variables such as permeability index, but the state variables will change with the control variables. In addition, the volume of the blast furnace is huge, and the gas–solid–liquid phase fluidity exists inside the blast furnace. The physical and chemical reactions in different parts of the blast furnace are significantly different. This difference makes the blast furnace have obvious thermal hysteresis and chemical hysteresis to the step of control parameters and the fluctuation of raw and fuel properties and compositions. There are intermittent and continuous, real-time and delay relationships in the process of setting and adjusting process parameters. To this end, based on the sample information to complete the spatiotemporal alignment between sample information and form data-driven samples with certain correspondence, a path adaptive model using simulated annealing algorithm to optimize the genetic algorithm is proposed, and this improved model is coupled with wavelet neural network algorithm to deeply explore the nonlinear and strong coupling relationship between large data sample information in the blast furnace smelting process and realize a model with permeability index. The model is a continuous forecasting model with the prediction of state variables such as permeability index as the node and the prediction of silicon content of molten iron as the endpoint.

2 Model for controlling silicon content of molten iron in blast furnace

The mathematical model mechanisms of the blast furnace smelting process are primarily based on hydrodynamic and metallurgical reaction kinetics.

  1. (1)

    The blast furnace smelting process can be examined from a fluid dynamics perspective. The “downward fluid” movement comprises the movement of raw fuel loaded at the furnace top under high-temperature physicochemical conditions. The “upward fluid” movement includes movement of the blast air, coal injection entering via the air outlet, and the resulting fluid movement in the plant from the chemical reactions. The applicable universal laws of fluid science are the mass conservation equation, equation of fluid motion, and equation of state of matter [16]. Considering only the fluid in the blast furnace axial z-direction moving at a material speed (vLS) and ignoring the radial (r) and annular (θ) effects on the movement speed, the simplified blast furnace fluid dynamics mathematical model can be expressed as follows:

    $$ \begin{gathered} \frac{\partial \rho }{{\partial t}} + \frac{{\partial \rho {\varvec{v}}_{{{\text{LS}}}} }}{\partial z} = k\rho \hfill \\ \frac{{{\text{d}}{\varvec{v}}_{{{\text{LS}}}} }}{\text{d}t} = G - \frac{1}{\rho }\frac{{Q_{{{\text{fq}}}} }}{{F_{{\text{f}}} }} \hfill \\ \end{gathered} $$
    (1)

    where Qfq is the air volume; ρ is the fluid density; k is the scale factor; Ff is the permeability index, defined as the ratio of air volume to differential pressure P, that is, Ff = Qfq/grad P; t is the time; and G is gravity.

  2. (2)

    The chemical reaction in blast furnace smelting process is the chemical reaction in fluid motion. From the perspective of mathematical modeling, more than 100 chemical reactions in blast furnace are abstracted into generalized chemical reaction equations (A + B = F + G + Um ∩ (Ts and p and w))i [17], where A and B are reactants, F and G are products, and Um denotes the thermal effect of the reaction process. The model constraints are the starting temperature Ts, pressure p, and concentration of reactants w required for the ith chemical reaction to proceed. Only chemical reactions that satisfy these constraints occur. Analyzed from the viewpoint of mathematical planning models, it constitutes a special mathematical planning problem—a microscopic, large scale, dynamic linear programming problem.

The intricate correlation of the two kinetic models causes extremely complex variations in the silicon content of molten iron. Figure 1 shows the migration trends of silicon content of molten iron in the longitudinal direction of the blast furnace, starting from the middle of the furnace to the window area. The silicon content increases with increasing carbon in the molten iron, reaching a maximum in the window area, and is higher than that in the final iron. Only the silicon content in the metal directly above the window is close to the outgoing iron value as silicon continues to enter the pig iron below the window [18]. Silicon is absent in the metal in the soft melt before the drip zone, and it is only in the high-temperature zone after the drip zone that silicon gradually enters the pig iron.

Fig. 1
figure 1

Metal iron and slag composition migration trends along longitudinal direction of blast furnace

A prerequisite for predicting the silicon content of molten iron is to clarify the silicon source, which determines the interval for estimating the silicon content of molten iron. Based on the material balance, SiO2 from the coke ash can supply the silicon needed for low-silicon iron in the form of SiO. However, as it is available in limited quantities (Table 1), silicon should also be obtained from the slag when smelting other iron types. Consequently, the thermodynamically permissible silicon content in blast furnace pig iron is much greater than the actual quantity of silicon supplied via coke ash. Moreover, the silicon reduction is governed by kinetic conditions such as the molten iron temperature and SiO2 activity in the slag.

Table 1 Maximum amount of silicon supplied by different coke ratios (kg/t)

A reasonable approach is to express the relationship between the thermal regime and the silicon in the pig iron in terms of the heat balance established by the volume of gas, the amount of material, and the respective temperature levels in the high-temperature zone below the drip zone. The residence time of the iron droplets in the reaction zones, that is, the gas–liquid contact time (t) in relation to the smelting cycle (τ) and drip zone height (h), is overlooked. Under countercurrent conditions, an increase or decrease in the iron solution residence time should be expressed as an increase or decrease in the residence time of the gas in this zone. Because there is no change in the amount of heat consumed per unit time and that supplied by the gas, the residual time (τh/H, where H is the height of blast furnace body) is eliminated when calculating the silicon content in pig iron.

To determine the estimated silicon content in the molten iron based on material balance, the blast furnace supervisor can evaluate the relevant fluctuation patterns by adjusting the key controlling parameters, such as the material speed, permeability index, air volume, and coal injection amount (Qpm). The air volume and the coal injection are related to fluid dynamics, while the material speed and the permeability index are related to chemical reaction kinetics and state variables of fluid dynamics. The partial differential equation for variations in the silicon content of molten steel can be expressed as follows:

$$ \frac{{{\text{d}}w_{{[{\text{Si}}]}} }}{{{\text{d}}t}} = L(t)\frac{{\partial {\varvec{v}}_{{{\text{LS}}}} }}{\partial t} + F(t)\frac{{\partial F_{{\text{f}}} }}{\partial t} + Q(t)\frac{{\partial Q_{{{\text{fq}}}} }}{\partial t} + M(t)\frac{{\partial Q_{{{\text{pm}}}} }}{\partial t} $$
(2)

where L(t), F(t), Q(t), and M(t) represent the influence coefficients of the differential terms material speed, permeability index, air volume, and coal injection amount, respectively, and are nonlinear functions that vary with furnace temperature and index conditions.

Through the above analysis, the estimation interval and the variation mechanism of the silicon content of molten iron can be determined. Consequently, a big data mining study of the blast furnace smelting process was conducted to achieve the multi-step prediction of the silicon content of molten iron continuously and accurately by incorporating data-driven technology.

3 Time registration between big data and sample information in blast furnace smelting process

3.1 Determining data index category of blast furnace smelting process based on control theory

The blast furnace is an inertial system from a control perspective. When many control parameters (actions) change, it causes a lag of state parameters. The possible lags and the corresponding response law should be digitally expressed to realize intelligent prediction and control of the blast furnace smelting process. Moreover, a certain transition is required when the blast furnace changes from one stable state to another. The first-order response, as expressed in Eq. (3), can schematically characterize the response process of the silicon content of molten iron, as shown in Fig. 2.

$$ y\left( {t + t_{D} } \right) = Kx - T\frac{{{\text{d}}y\left( {t + t_{D} } \right)}}{{{\text{d}}t}} + y_{0} $$
(3)

where x denotes a step, that is, a control parameter; tD is the time delay; T is the time constant; y is the final effect; K is the gain; y0 denotes the response with time t = 0; and y(t + tD) denotes the response with time t. After a step is sent, a response can only occur after a time delay, reaching a stable state after a period of 4 T or 5 T. Hence, the actual period from changing a control parameter to obtaining the desired response is tD + 4 T or tD + 5 T.

Fig. 2
figure 2

Schematic of first-order response process of silicon content in molten iron

Several parameters (including interference) often play a key role in strengthening or weakening the changes in the silicon content of molten iron. Often, other parameters can change before the completion of specific behavior, which can positively or negatively impact the previous transition behavior, resulting in a complex transition behavior. Consequently, the big data from the blast furnace smelting process should be further classified from a control system perspective.

  1. (1)

    Control variables are the variables controlled when the blast furnace length adjusts the operational state of the blast furnace. It comprises the relevant variables of the upper distribution and lower blast systems. Typical indicators include blast volume, blast temperature, blast humidity, coal injection volume, oxygen enrichment rate, pulverized coal composition, hot air pressure, and air pressure.

  2. (2)

    State variables are the primary performance indicators that measure the operational state of the blast furnace. These variables are concentrated in the blast furnace body.

3.2 Big data sample set of blast furnace smelting process based on time registration

Abnormalities in the collected data can result from the failure of blast furnace detection instruments and the abnormal operation of the blast furnace [19]. This study completes the abnormal data processing information collected during the blast furnace smelting process using the following three steps:

Step 1: Abnormal value detection. The 3σ criterion is adopted as the criterion to detect abnormal values in the blast furnace sample set, X = {x2, …, xn}. When the absolute difference between the detected and average value is greater than 3σ, it is marked as an abnormal value; σ can be expressed as follows:

$$ \sigma = \sqrt {\sum\limits_{i = 1}^{n} {e_{i}^{2} /(n - 1)} } = \sqrt {\sum\limits_{i = 1}^{n} {(x_{i} - \overline{x})^{2} /(n - 1)} } $$
(4)

where n is a positive integer greater than or equal to 2; xi is the ith measurement of index \({\varvec{X}}\); \(\overline{x}\) is the average of all measurements of index \({\varvec{X}}\); and ei is the difference between the xi and the \(\overline{x}\).

Step 2: Outlier classification. This study compares the curves of the permeability index, cold air pressure, and silicon content of molten iron as indexes closely related to the molten iron content. Figure 3 shows that the molten iron silicon contents obtained from the eighteenth and thirty-second heats are at the lowest point, belonging to a suspicious point of abnormal value. Analyzing the changes in the cold air pressure and permeability index—the cold air pressure decreases considerably, accompanied by an increase in the permeability index—shows signs of cooling furnace temperature. Consequently, it was determined that the suspicious points of these two abnormal data are non-abnormal values and vice versa.

Fig. 3
figure 3

Outlier determination

Step 3: Normalization of indicator data. Based on Steps 1 and 2, the abnormal data caused by sensor failure are deleted, and the abnormal data caused by the abnormal operational state of the blast furnace are retained. The sample indexes after abnormal data processing differ owing to the differences in their dimensions. For example, the blast furnace permeability index range is [50, 100], while the cold air flow parameter range is [1800, 2300]. It is unreasonable to use them directly in predicting molten iron silicon content, as they can falsely influence the prediction results. Data normalization in Eq. (5) can be used to control the range of control parameters, such as the coal injection volume and air pressure, between [0,1] to improve the accuracy of subsequent model predictions.

$$ x_{i}^{ * } = \frac{{x_{i} - \min ({\varvec{X}})}}{{\max ({\varvec{X}}) - \min ({\varvec{X}})}} $$
(5)

where \(x_{i}^{ * }\) is the standard value of xi.

The data-driven technology and intelligent algorithms deeply mine the big data derived from the blast furnace smelting process to realize the accurate and continuous prediction of molten iron silicon content. The premise involves selecting the appropriate variables and their most appropriate time value. The correlation between the control variables, state variables, and molten iron silicon content under different time delay is obtained using the Spearman correlation analysis to realize the time registration between the index data. The principle of maximum correlation between the characterization parameters and molten iron silicon content under different time delay helps select the most appropriate time value of the variable.

Table 2 shows the sample indicator variable symbols and descriptions.

Table 2 Sample indicator variable symbols and descriptions

Step 1. Obtain 68 groups of index data—with a sampling period of 1 h—which correspond with their collection time and can be used to generate the initial sample set.

Step 2. Derive each index corresponding to the molten iron silicon content sequence under 0 delays based on eight different delay sequences. A sequence correspondence under four delays is shown in Fig. 4. The abscissa “0” refers to the current time, “1” refers to the time in the next 1 h, that is, the time of the nearest sampling in the future, and “ − 1” refers to the time of the nearest sampling from the current time in history.

Fig. 4
figure 4

Corresponding relationship diagram of correlation coefficients between index sequences under different delays to be solved

Step 3. Calculate and record the absolute correlation coefficient between each index in Fig. 4 and the molten iron silicon content sequence under eight different delays.

Step 4. Compare the absolute value of the correlation coefficient of each row in Table 3 and underline and mark bold to the largest value. If there is little difference between the two or three adjacent correlation coefficients and they are considerably higher than the coefficients under other delays, underline and mark bold to those coefficients too.

Table 3 Raw material control variable information for time registration

Step 5. Record the delay corresponding to the marked correlation coefficient in Table 3 and assign the delay directly to τi. Take the corresponding index data sequence as the sample input.

We collected a big data sample set from the smelting process of a blast furnace (3200 m3) in a steel plant in northern China. The samples comprised 55,000 hot metal sampling and testing results from November 1, 2019 to October 31, 2021 and the corresponding raw material and process information. Based on a sampling period of 1 h, a sample set using the variables shown in Table 2 as the input and the molten iron silicon content as the output for a given time sequence was constructed. Spearman analysis was used to calculate the correlation coefficient between the 68-index data shown in Table 2 (set time delay 0, 1, 2, 3, 4, 5, 6, and 7 h). After the above five steps, Table 3 is obtained, the order of delay being from small to large. We discern the following special cases.

  1. (1)

    The indexes with an absolute value of correlation coefficient greater than 0.8 with data series of the molten iron silicon content include the actual coal injection amount {y23(t), y23(t − 1)} under 0 and 1 delay, the permeability index {y41(t − 1)} under 1 delay, the heat supplied to the lower part of the cohesive zone under 1 and 2 delay {y50(t − 1), y50(t − 2)}, the proportion of the heat supplied to the lower part of the cohesive zone under 1 delay in the total heat supply {y51(t − 1)}, and the historical hot metal silicon content {Y[Si](t − 1)} under 1 delay.

  2. (2)

    At 0 and 1 time delay, the correlation coefficient with the data series of the molten iron silicon content is relatively large for the air temperature {y20(t), y20(t − 1)}, actual coal injection {y23(t), y23(t − 1)}, oxygen enrichment {y26(t), y26(t − 1)}, total load {y29(t), y29(t − 1)}, permeability index {y41(t), y41(t − 1)}, and single layer coke thickness {y44(t), y44(t − 1)}.

  3. (3)

    At 1 and 2 time delay, the correlation with the data series of the molten iron silicon content is a relatively large indicator of the heat supplied to the lower part of the cohesive zone {y50(t − 1), y50(t − 2)}.

  4. (4)

    At 2 and 3 time delay, correlations are exhibited by the differential pressure {y40(t − 2), y40(t − 3)} and the gas quantity at the air outlet {y47(t − 2), y47(t − 3)}.

  5. (5)

    At 1, 2, and 3 time delay, the correlations with the data series of the molten iron silicon content are relatively large indicators of the molten iron silicon content at historical moments {Y[Si](t − 1), Y[Si](t − 2), Y[Si](t − 3), Y[Si](t − 4), Y[Si](t − 5)}.

  6. (6)

    At 3 and 4 time delay, the correlation is relatively large for the total heat supply {y49(t − 3), y49(t − 4)}.

  7. (7)

    At 3, 4, and 5 time delay, the correlation is relatively large for the gas index of the furnace belly {y47(t − 3), y47(t − 4), y47(t − 5)}.

4 Development of a dynamic and continuous prediction model for molten iron silicon content

Based on modern cybernetics, the blast furnace smelting process can be simplified into a multiple-input single-output system, and the molten iron silicon content can be predicted using the operating parameters. When the system is in a relatively stable state, the molten iron silicon content at t + 1 can be expressed as follows:

$$ y(t) = \phi_{1} y(t - 1) + \cdots + \phi_{n} y(t - n_{\phi } ) + \gamma_{1} u(t - 1) + \cdots + \gamma_{n} u(t - n_{\gamma } ) + \theta_{1} e(t - 1) + \cdots + \theta_{n} e(t - n_{\theta } - 1) + \theta (t) $$
(6)

where y(t) and y(t − 1) denote the molten iron silicon content at t and t − 1, respectively; u denotes the input variable, that is, the operating parameter to be considered, such as wind temperature and load; e denotes the error of the estimated value; φ, γ, and θ denote the model parameters; and φn, γn, and θn denote the response order of y, u and e, respectively.

Using Eq. (6), the dynamic data system model of hot metal silicon content can be obtained as shown by \(\hat{y}(t)\) in Eq. (7) that represents the estimated value of y(t).

$$ \hat{y}(t) = \phi_{1} y(t - 1) + \cdots + \phi_{n} y(t - n_{\phi } ) + \gamma_{1} u(t - 1) + \cdots + \gamma_{n} u(t - n_{\gamma } ) + \theta_{1} e(t - 1) + \cdots + \theta_{n} e(t - n_{\theta } - 1) $$
(7)

In a complex blast furnace smelting process, the adjustment of one control variable can often cause changes in multiple state variables. In addition, each state variable can be affected by multiple control variables. Moreover, the static prediction model of molten iron silicon content based on historical data can be affected by the actual working conditions, resulting in poor robustness. By analyzing the time registration between big data in the blast furnace smelting process and studying the impact of different index data on state variables, the multi-path coupling relationship between variables, and the influence of each index on molten iron silicon content, the dynamic and continuous prediction of molten iron silicon content can be realized [20,21,22]. Through path optimization, the impact of the characterization parameters on different control variables as the starting point and the hot metal silicon content as the endpoint can be obtained. After retrieving the state variables in each path, we can use the characterization parameters before the state variables on each path as the input and the corresponding state variables as the output to predict each state variable. Finally, we integrate all control and state variables as the input samples to realize the dynamic prediction of molten iron silicon content. This study designed an improved path optimization algorithm by coupling simulated annealing algorithm (SA) and genetic algorithm (GA). The implementation framework of the wavelet neural network algorithm can be used to construct a cascade dynamic and continuous prediction model of molten iron silicon content.

More indexes related to the molten iron silicon content should be considered to form index redundancy and improve the model prediction accuracy [23]. However, as more than one index data is possible in a physical sense, and sequences with different delays strongly correlate with the molten iron silicon content, it is necessary to re-plan the variables, dividing the index data into more than one index based on the delay. The sample input set for predicting the molten iron silicon content under the time sequence constructed in this study has 83 elements (Table 3), of which 40 are control variables, and 43 are state variables, including the molten iron silicon content at time delay 1, 2, and 3). Assuming that the set of control and state variables are B and C, respectively, then A = B ∪ C. The elements in the sets of B and C can be expressed as follows:

$$ \begin{array}{*{20}c} {{\varvec{B}} = } & \{ & {y_{1} (t - 7)} & {y_{2} (t - 7)} & {y_{3} (t - 7)} & {y_{4} (t - 7)} & {} \\ {} & {} & {y_{5} (t - 7)} & {y_{6} (t - 7)} & {y_{7} (t - 7)} & {y_{8} (t - 7)} & {} \\ {} & {} & {y_{9} (t - 7)} & {y_{10} (t - 7)} & {y_{11} (t - 7)} & {y_{12} (t - 7)} & {} \\ {} & {} & {y_{13} (t - 7)} & {y_{14} (t - 7)} & {y_{15} (t - 7)} & {y_{16} (t - 7)} & {} \\ {} & {} & {y_{17} (t - 5)} & {y_{18} (t - 5)} & {y_{19} (t - 6)} & {y_{20} (t - 7)} & {} \\ {} & {} & {y_{24} (t)} & {y_{25} (t)} & {y_{25} (t - 1)} & {y_{26} (t)} & {} \\ {} & {} & {y_{27} (t - 6)} & {y_{28} (t)} & {y_{28} (t - 1)} & {y_{29} (t - 3)} & {} \\ {} & {} & {y_{30} (t)} & {y_{31} (t)} & {y_{31} (t - 1)} & {y_{32} (t)} & {} \\ {} & {} & {y_{33} (t)} & {y_{34} (t)} & {y_{34} (t - 1)} & {y_{35} (t - 2)} & {} \\ {} & {} & {y_{36} (t - 3)} & {y_{37} (t - 2)} & {y_{38} (t)} & {y_{39} (t - 4)} & \} \\ \end{array} $$
(8)
$$ \begin{array}{*{20}c} {{\varvec{C}} = } & \{ & {y_{21} (t - 2)} & {y_{22} (t - 6)} & {y_{23} (t - 2)} & {y_{40} (t - 2)} & {} \\ {} & {} & {y_{41} (t - 1)} & {y_{42} (t - 1)} & {y_{43} (t - 1)} & {y_{44} (t - 1)} & {} \\ {} & {} & {y_{45} (t - 2)} & {y_{45} (t - 3)} & {y_{46} (t)} & {y_{46} (t - 1)} & {} \\ {} & {} & {y_{47} (t)} & {y_{48} (t - 1)} & {y_{49} (t)} & {y_{49} (t - 1)} & {} \\ {} & {} & {y_{50} (t - 6)} & {y_{51} (t)} & {y_{52} (t - 2)} & {y_{52} (t - 3)} & {} \\ {} & {} & {y_{53} (t - 3)} & {y_{53} (t - 4)} & {y_{53} (t - 5)} & {y_{54} (t - 3)} & {} \\ {} & {} & {y_{54} (t - 4)} & {y_{55} (t - 1)} & {y_{55} (t - 2)} & {y_{56} (t - 1)} & {} \\ {} & {} & {y_{57} (t - 7)} & {y_{58} (t)} & {y_{59} (t)} & {y_{60} (t)} & {} \\ {} & {} & {y_{61} (t - 5)} & {y_{62} (t - 3)} & {y_{63} (t - 3)} & {y_{64} (t - 3)} & {} \\ {} & {} & {y_{65} (t - 3)} & {y_{66} (t - 3)} & {y_{67} (t - 3)} & {y_{68} (t - 3)} & \} \\ \end{array} $$
(9)

Based on the sets shown in Eqs. (8) and (9), a weighted directed graph of the silicon content in the iron can be constructed, with the weights and directions of the directed graph being specified as follows.

  1. (1)

    Taking all elements in the set as nodes and the correlation coefficient between elements as the weight, the correlation coefficient matrix between the data series of each variable is 83 × 83 (since the number of variables studied is 83). However, the calculation method is simple and can be solved quickly using SPSS software; hence, it is not shown in this paper.

  2. (2)

    The effect of the silicon content on each direction of molten iron can be determined. From any element, the arrow always points to an element with less delay. If the delays of two elements are equal, the two elements are the starting and endpoint of each other.

  3. (3)

    Starting from any element, the path that can reach the molten iron silicon content at the current time through the directed graph can be considered as one path. The travel route can only move forward without turning back. This path reflects the information transfer direction and influence relationship among the characteristic parameters of blast furnace.

  4. (4)

    The indexes to determine the advantages and disadvantages of a path are the least cost (eliminating the correlation between variables) and longest path (providing more sample input redundancy), that is, the sum of path weights is the smallest on the premise of traversing as many nodes as possible.

GA and SA are intelligent algorithms used to solve complex programming problems. A GA and SA coupled blast furnace characterization parameter path optimization algorithm was designed to solve the multi-path coupling problem between variables. The proposed algorithm adopts the strong overall searchability of GA and the strong local searchability of SA to circumvent the disadvantages of insufficient stability of GA and the time-consuming overhead of SA. Based on the correlation between each index under different time delay sequences and the correlation with the molten iron silicon content under different time sequences, a weighted directed graph of the characterization parameters of the molten iron silicon content can be constructed. Several best influence paths in-line with the actual production situation of the blast furnace can be excavated using different control variables as the starting point, laying the foundation for the follow-up intelligent learning of the wavelet neural network algorithm.

The path optimization algorithm of the GA and SA coupling technique is described as follows.

  1. (1)

    Using any control variable as the starting point and the molten iron silicon content at time t as the endpoint, the initial population can be randomly generated.

  2. (2)

    Determine the fitness functions. Set the fitness function as shown in Eq. (10), where αi denotes the correlation between the ith and (i + 1)th parameter, k is a coefficient, and m denotes the number of individuals in the population.

  3. (3)

    The roulette method can be adopted. Based on individual fitness, some individuals with greater fitness are selected from the population with the probability of \(p^{\prime}_{i}\) to form a mating pool, as expressed in Eq. (11), where N is the population, and Fi is the fitness of the ith individual.

  4. (4)

    The adaptive probability function can cross, mutate, and update the mating pool. The adaptive probability function can be expressed as shown in Eq. (12), where f denotes the larger fitness of the two individuals crossing, favg denotes the average fitness value, fmax denotes the best fitness, and p1 and p2 are constants.

  5. (5)

    SA can be applied to the new individuals generated after crossover and mutation. The accepted individuals and their parents conduct elite selection to form their offspring population.

  6. (6)

    Repeat execution of Steps 2–5. Stop when the fitness function value no longer increases.

    $$ F = \sum\limits_{i = 1}^{m - 1} {\frac{{k\alpha_{i} }}{m - 1}} $$
    (10)
    $$ p^{\prime}_{\rm A} = \frac{{k/F_{i} }}{{\sum\limits_{i = 1}^{N} {k/F_{i} } }} $$
    (11)
    $$ p_{1} = \left\{ {\begin{array}{*{20}l} {\left( {f_{{\max }} - f} \right)/\left( {f_{{\max }} - f_{{{\text{avg}}}} } \right),{\text{ }}} \hfill & {f > f_{{{\text{avg}}}} } \hfill \\ {p_{2} ,} \hfill & {f \le f_{{{\text{avg}}}} } \hfill \\ \end{array} } \right. $$
    (12)

This study used the wavelet neural network algorithm to learn the sample set and mine the quantitative relationship between sample inputs and outputs. Though the wavelet and back propagation (BP) neural network algorithms are multilayer feedforward networks, the kernel functions are wavelet and sigmoid functions, respectively. Wavelet analysis can better determine the local characteristics of the signal by transforming the wavelet basis function. The corresponding weight and activation threshold from the input to the hidden layer can be replaced by the scale expansion and time translation factors of the wavelet function. Moreover, the gradient correction method can modify the weight and wavelet basis function parameters of the network, giving the wavelet neural network stronger nonlinear approximation abilities.

The training steps of the wavelet neural network algorithm are as follows:

  1. (1)

    Randomly initialize the scaling factor, translation factor, and network connection weight of the wavelet function, and set the network learning rate.

  2. (2)

    Divide the samples into training and test samples. The training samples are used for network training, and the test samples to test the network prediction accuracy.

  3. (3)

    Input the training samples into the network, to calculate the prediction output and the error between the network and expected output.

  4. (4)

    The network weight and wavelet function parameters can then be corrected based on the error, making the network predicted value close to the expected value in the future.

  5. (5)

    Determine whether the algorithm ends. If not, return to 3.

Thus, starting from each control variable, that is, the elements in set B, the optimal path to reach the molten iron silicon content can be found using the improved GA and SA. The corresponding nodes of elements in set C can be retrieved by integrating the optimal path results. The characterization parameters before the node are used as the prediction input of the state variables [24]. The prediction value of elements in C can be obtained using the wavelet neural network and is substituted into the prediction input data of the molten iron silicon content to realize dynamic prediction [25]. The dynamic and continuous prediction model of molten iron silicon content based on path optimization is shown in Fig. 5.

Fig. 5
figure 5

Dynamic prediction model of molten iron silicon content based on path optimization

The dynamic and continuous prediction model of molten iron silicon content has the following advantages:

  1. (1)

    Supplement the data set under low delay. During the early stage of blast furnace operation, the state variables with a small time delay with the molten iron silicon content have not yet been obtained. Therefore, the corresponding state variables are predicted through the known control variables that supplement the input data of the prediction model to realize a dynamic prediction model.

  2. (2)

    Dynamic precise control. Based on the operational mechanism of the blast furnace and actual operation of workers, this model aims to realize the dynamic control and early prevention of furnace temperature. The workers can predict changes in corresponding state parameters by assuming the input control parameters to predict changes in the furnace temperature. Thus, the goal of a stable and abundant furnace temperature and conditions can be achieved.

5 Results and discussion

5.1 Optimal path for silicon content prediction in hot metal

The primary goal of the research is to find the optimal prediction path for the molten iron silicon content. To verify the excellent properties of the improved objective algorithm (GA and SA) in path finding, we performed simultaneous search operations using the three algorithms and compared their performance in terms of iteration times and fitness functions. The smaller the number of iterations and the larger the fitness function, the better the performance. This metric is equivalent to the metric for determining the merit of a path.

By analyzing the elements in set B, a longer path can be considered using the elements in the subset B as the starting point, as follows:

$$ \begin{array}{*{20}c} {{\varvec{B^{\prime}}} = } & \{ & {y_{1} (t - 7)} & {y_{2} (t - 7)} & {y_{3} (t - 7)} & {y_{4} (t - 7)} \\ {} & {} & {y_{5} (t - 7)} & {y_{6} (t - 7)} & {y_{7} (t - 7)} & {y_{8} (t - 7)} \\ {} & {} & {y_{9} (t - 7)} & {y_{10} (t - 7)} & {y_{11} (t - 7)} & {y_{12} (t - 7)} \\ {} & {} & {y_{13} (t - 7)} & {y_{14} (t - 7)} & {y_{15} (t - 7)} & {y_{16} (t - 7)} \\ {} & {} & {y_{20} (t - 7)} & {y_{19} (t - 6)} & {y_{27} (t - 6)} & \} \\ \end{array} $$
(13)

In Table 4, the above 19 elements were used as the path starting points, and the three algorithms were applied simultaneously to set A. The number of iterations and the value of the fitness functions for each optimal path were obtained for comparative analysis. Based on the data summarized in Table 4, the following results were obtained:

  1. (1)

    Comparative analysis of the number of iterations. The GA achieved a stable value of fitness function at an average of 68 iterations in determining the optimal path, while the SA algorithm required 126 iterations. Based on the relationship between the number of iterations and the running time of the algorithm, the SA algorithm took twice as long to run as GA. The average number of iterations of the GA and SA algorithm was close to that of GA, suggesting that the target algorithm inherited the advantage of the rapid convergence of GA.

  2. (2)

    Comparative analysis of fitness values. The average fitness function value of the GA for searching the optimal path for the 19 different starting points in Table 4 was 0.24004, while that of the SA algorithm was 0.28407, with an improvement of 18.34%. The average fitness function value for the target algorithm was 0.30021, which is significantly better than those of the individual GA and SA algorithms. Thus, the target algorithm inherited the global rapid search capability of GA and the local optimal value-seeking capability of SA.

  3. (3)

    Overall. The GA and SA algorithm exhibits excellent path finding performance and quality, achieving a comparable number of iterations to the GA algorithm and optimal fitness values. Paths (1) y4(t − 7) → Y[Si](t), (2) y7(t − 7) → Y[Si](t), (3) y8(t − 7) → Y[Si](t), (4) y19(t − 6) → Y[Si](t), and (5) y22(t − 6) → Y[Si](t) have higher values of fitness function. Overall, the fitness value achieved using the GA and SA for path (2) was considerably better than that of the two comparison algorithms and had the highest fitness values of all paths.

Table 4 Comparison of three paths optimization algorithms

The molten iron silicon content prediction path aims to eliminate the correlation between variables and reduce the number of input indicators while considering the coupling synergy of multiple variables in terms of duration and structure. Thus, a cascade of continuous molten iron silicon content predictions is achieved with coupled state variable node studies to determine that this model works best.

The state variable permeability index (y41(t)) and theoretical combustion temperature (y46(t)) with “0” delay as the molten iron silicon content at the current moment were selected as evaluation indicators to verify this assumption. We compared the forecast accuracy of y41(t) and y46(t) on paths (1)–(5) and selected the path with the highest forecast accuracy for both state variables as the optimal path.

We used the wavelet neural network algorithm to learn the sample data on paths (1)–(5) to obtain comparison plots of the forecast results of the permeability index and theoretical combustion temperature. The actual values and error plots when forecasting the permeability index under the five paths are shown in Figs. 6 and 7, and those for the theoretical combustion temperature are shown in Figs. 8 and 9. The wavelet neural network structure used in this study was 5 − 30 − X. The five nodes in the input layer indicate the number of steps in the prediction sequence. The 30 and X nodes in the hidden and output layers indicate the number of state variables on the prediction path.

Fig. 6
figure 6

Comparison of prediction results of ventilating index under 5 paths

Fig. 7
figure 7

Comparison of prediction error of permeability index under 5 paths time series

Fig. 8
figure 8

Comparison of theoretical combustion temperature prediction results under 5 paths

Fig. 9
figure 9

Comparison of theoretical combustion temperature of prediction error of permeability index under 5 paths time series

Figure 6 represents the comparison of the prediction effects of the permeability state variables achieved under the five preferred paths. Since the starting point of path (2) is the raw material information and the lag time is the longest (7 h), the path can traverse more selective nodes from the starting point to the endpoint, so that the sample set is guaranteed to be rich in content while seeking the optimal selection. It is verified through simulation prediction tests that the sample set formed based on path (2) is the most superior and can make the prediction results closer to the actual values. In addition, Fig. 7 shows the distribution of the permeability prediction error under the five preferred paths, which shows that the prediction error of the algorithm using the sample set formed by path (2) is within [− 1, 1], the prediction error of path (3) is within [− 5, 5], and the prediction error of the remaining three preferred paths is within [− 20, 20]. Figures 8 and 9, which use the theoretical combustion temperature as the prediction node in the data representation, again confirm that the best prediction is achieved using path (2) to form the sample set.

Figures 6, 7, 8 and 9 show that the predicted and actual values are the closest to path (2), y7(t − 7) → Y[Si](t), for the permeability index and theoretical combustion temperature forecasts. Consequently, path (2) was used as the optimal path for the molten iron silicon content prediction.

5.2 Predicting molten iron silicon content

The sample set for prediction can be divided into three categories as follows:

  1. (1)

    Class 1 data use all the variables in Table 2 as a sample set, including instant and historical data collected during the blast furnace smelting process. To realize the advanced prediction of the molten iron silicon content, we pay more attention to the effective predictions of Y[Si](t + 1), Y[Si](t + 2), Y[Si](t + 3), Y[Si](t + 4), and Y[Si](t + 5). The input of the sample output time ahead or the synchronous sample input index when predicting Y[Si](t + 1) can be the data from the actual historical data aligned to moment t + 1 when the sample input indicators in Table 2 are control variables. For example, the kinetic energy of the blast (y28) control variable can be used directly as a sample input with its corresponding time series y28(t + 1). When the sample input indicator is a state variable, the sample input indicators in Table 2 at time t can only be selected. For example, the theoretical combustion temperature (y46) state variable can only be used as a sample input at time t with y46(t), not with its temporal counterpart y46(t + 1).

  2. (2)

    Class 2 data are immediate and historical data based on the control variables selected after optimal filtering and the state variables that pass through the path. The selection constraints for the indicators are the same as those of Class 1 data. Certain state variables cannot be applied directly, and the number of variables is only reduced by path finding.

  3. (3)

    Class 3 data are immediate and historical data for control variables selected based on optimal path filtering and predicted values for state variables. The selection of indicators for Class 3 data would not have the constraints of Class 1 and Class 2 data. The state variables corresponding to the time series Y[Si](t + 1), Y[Si](t + 2), Y[Si](t + 3), and Y[Si](t + 4) are obtained via prediction, then used as inputs, so that the prediction process could achieve dynamic self-renewal of the sample set. The wavelet neural network algorithm intelligently learns the above three classes of sample sets to construct multi-step prediction models for the molten iron silicon content Y[Si](t), Y[Si](t + 1), Y[Si](t + 2), Y[Si](t + 3), Y[Si](t + 4), and Y[Si](t + 5). A comparison of the prognosis results and analysis is shown in Figs. 1015.

Fig. 10
figure 10

Comparison of real-time prediction effect

Fig. 11
figure 11

Comparison of step 1 ahead prediction effect

Fig. 12
figure 12

Comparison of step 2 ahead prediction effect

Fig. 13
figure 13

Comparison of step 3 ahead prediction effect

Fig. 14
figure 14

Comparison of step 4 ahead prediction effect

Fig. 15
figure 15

Comparison of step 5 ahead prediction effect

When comparing the real-time forecast results, since Class 2 and 3 sample sets are similar at this stage, the forecast accuracy is also equivalent. The forecast using Class 1 is the worst. Firstly, it considers too many redundant variables, resulting in a small proportion of effective information. Secondly, the learning is poor because the parameter selection of the wavelet neural network algorithm depends on the few useful sample data for path optimization.

When comparing the 1- to 5-step ahead prediction results, the prediction accuracy using the Class 3 sample set always shows a considerable advantage. The forecast accuracy of Class 1 and 2 sample sets is low, with a maximum error boundary. We conclude that some of the control variables in the historical data play a role in frame support, that is, the empirical control of operations such as adjusting the fabric and improving the blast can change the molten iron silicon content trends. However, accurate research, consideration, and control cannot be achieved from an accuracy perspective.

The average error values were analyzed to determine the validity of the proposed method. Table 5 confirms the advantages and extension value of the proposed step-by-step silicon content prediction technique. The mean absolute error and coefficient of variation increase, and the prediction hit rate decreases on the sequence of real-time forecasts in molten iron silicon content to 5-step ahead forecasts in all three classes. The results reflect the dependence of forecast accuracy on the sample set validity, with a strong correlation between the big data collected during the blast furnace smelting process and the molten iron silicon content. Using Class 1 and 2 data, the average absolute error and hit rate in the predicted molten iron silicon content from 1- to 5-step ahead were in the range of [0.241, 0.254] and [17.5%, 23.0%], respectively. Although the hit rate of real-time prediction was above 94%, it was ineffective in providing a reference for the overcasting of the molten iron silicon content regulation. Class 3 data achieved over 87% of the 5-step ahead forecasts, with 91.16% of the forecasts hitting 1-step ahead. Class 3 data were a dynamic sample set that achieved good prediction accuracy; however, the accuracy gradually decreased with accumulating prediction errors in the state variables, with silicon content as the endpoint.

Table 5 Comparison of step-by-step prediction effect of molten iron silicon content in blast furnace based on learning sample differences

In summary, the wavelet neural network algorithm is an intelligent algorithm suitable for learning sample sets with temporal correlation. The prediction accuracy of the same intelligent algorithm varies considerably depending on the sample sets category. y7(t − 7) → Y[Si](t) was used as the optimal path for the prediction of molten iron silicon content. The sample set was dynamically updated to achieve over 87% molten iron silicon content prediction accuracy 5-step ahead.

6 Conclusions

  1. 1.

    The optimal selection of sample indicators, based on the principle of maximum correlation and time alignment, was a prerequisite for ahead-of-time, continuous, and accurate prediction of the molten iron silicon content. In addition, it contributed to creating a good data ecological environment for using intelligent algorithms.

  2. 2.

    Adopting the GA and SA coupling algorithm, we optimized the molten iron silicon content prediction path and reduced the sample input indicator, improving the effectiveness of the sample set information. Moreover, it lowered the complexity of the intelligent data learning algorithm, directly enhancing the learning efficiency of the wavelet neural network algorithm.

  3. 3.

    Based on the optimization of the molten iron silicon content forecasting path, a self-learning forecasting model based on control variables is constructed with control variables as sample inputs and lagging state variables in the path as sample outputs. The model is the driving force to drive the dynamic update of the sample set and the guarantee to achieve 1, 2, 3, 4 and 5-step ahead prediction of the silicon content of molten iron.

  4. 4.

    The proposed cascade model with coupled state variable nodes achieved a forecasting accuracy of 91.16% in 1-step ahead predictions and a hit rate of 87.41% in 5-step ahead predictions.

  5. 5.

    The model formed a paradigm comprising data information self-processing, prediction path self-optimizing, and state variable node information self-updating. Moreover, the step-by-step and continuous prediction of molten iron silicon content can help develop intelligent control systems for blast furnaces.