Introduction

The Bayer process was studied and patented by Karl Josef Bayer and is still the most economic procedure for producing 1 ton of alumina from 1.9–3.6 t of bauxite ores (Hind et al. 1999). The process consists of three steps (Bahrami et al. 2012a; Sidrak and research 2001):

  • Bauxite is extracted from the mine and mixed with a hot stream of sodium hydroxide before entering the dissolution tank with digester slurry.

  • Precipitation of cooled supersaturated sodium aluminate liquor

  • Calcination of the aluminum hydroxide to Al2O3 and classification of the deposited crystals based on their size

The most important step for the production of alumina in the Bayer process is the precipitation stage. The precipitation step determines the final production efficiency of aluminate liquor. Super-saturation of aluminum oxide (Muhr et al. 1997), temperature (Liu et al. 2020), seed concentration (Paspaliaris et al. 1999a), impurities (Dorin and Frazer 1988; Ostap 1986; Rosenberg 2017; Vogrin et al. 2020; Wellington et al. 2007), and mixing rate affect (Bahrami et al. 2012b; Misra 2016) the precipitation process (Totten and MacKenzie 2003a).

The main step in the Bayer process for producing alumina trihydrate is precipitation. This technique involves rapidly forming a frugally soluble solid phase from a liquid solution phase (Seecharran 2010). The performance of the precipitation step strongly affects the produced aluminum’s quality, productivity, and selectivity. The appropriate control of this stage is essential to achieving maximum production efficiency, minimum operating cost, optimal particle size distribution, and high product purity (Geoff Bearne 2017). Significant parameters in the precipitation step include super-saturation (Ilievski and Livk 2006; Veesler et al. 1994; Yu et al. 2020), temperature (Huang et al. 2019), caustic ratio (Satpathyc 2014), stirring rate (Ilievski and Livk 2006; Zhou et al. 2018), seed concentration (Zhang et al. 2006), residence time, solids content, the presence of any impurities (Hui-bin Yang 2020; Smeulders et al. 2001; Zhang et al. 2018, 2006, 2020), and additives (Zeng J 2007; Liu et al. 2018; Lü et al. 2010; Sahu and Meyrick 2015; Paulaime et al. 2003; Yin et al. 2006; Zhang et al. 2009; Zeng et al. 2008) that decelerate the hydrate growth/agglomeration (Sonthalia et al. 2013).

In the precipitating solution, super-saturation degree is the most important parameter, which is affected by temperature, concentration, and caustic ratio and affects the rates of various processes, such as nucleation and growth. The solubility of alumina in the precipitation step strongly depends on both the tank and ambient temperatures (Bahrami et al. 2012a). In this regard, as temperatures rise, super-saturation and precipitation efficiency decrease. It should be noted that reducing the solution’s viscosity reduces the temperature’s effect and increases efficiency. Super-saturation and precipitation rates decrease when the caustic ratio increases, resulting in a reduction in particle size and morphological irregularities. Another important and limiting parameter in the precipitation process is the residence time of aluminum hydroxide seed precipitation, which varies approximately between 48 and 72 h. As the residence time increases, the super-saturation degree decreases and the precipitation efficiency increases (Seecharran 2010).

Zhang et al. (2009) and Sahu and Meyrick (2015) studied the effect of the caustic ratio on precipitation efficiency. The results show that increasing the caustic ratio reduced the precipitation rate.

Yu et al. (2020) measured the influence of super-saturation on the precipitation rate. Due to the increase in driving force for the migration of aluminum from the soluble phase to the crystal surface, the higher super-saturation led to an increase in the precipitation rate. Furthermore, Veesler et al. (1994), Ilievski et al. (2006) Barata et al. (1996), and Zhang et al. (2009) investigated the super-saturation parameter and found comparable results.

Huang et al. (2019) studied the temperature effect on the efficiency of the precipitation rate. The results show that with increasing temperatures, the precipitation rate decreases. Also, Zhang et al. (2009) and Yu et al. (2020) reported similar results. However, Sahu et al. (2015) obtained results that contradicted those of other researchers. They claimed that increasing the temperature would increase the precipitation rate.

Many studies have been conducted to investigate the effect of the stirring rate on the precipitation rate. As the stirring rate increases, the precipitation efficiency increases to an optimal point and then decreases (Ilievski and Livk 2006; Veesler et al. 1994; Zhang et al. 2009; Zhou et al. 2018).

Zhang et al. (2006) studied the effect of seed concentration on the precipitation rate. The results showed that as the seed concentration increased, so did the available surface area of ions on the seed surface. Bahrami et al. (Seecharran 2010) detected a similar trend in the effect of seed concentration on precipitation efficiency.

The effect of different additives (i.e., methanol, mono-hydroxyl alcohol, ether crone, ethylene diamine-tetra-acetic acid (EDTA), urea, and tetracarbon) on precipitation rate was also investigated. The positive effect of these additives on the precipitation efficiency in the Bayer process is evident (Satpathyc 2014; Zhang et al. 2007; LÜ et al. 2010; Yin et al. 2006; Ying Zhang et al. 2009; Zeng et al. 2008). Table 1 illustrates the effect of different parameters on the precipitation efficiency in the Bayer process.

Table 1 Effects of different parameters on the precipitation efficiency

Various methods, such as theoretical, experimental, semi-experimental, numerical, artificial neural networks (ANN), and machine learning approaches, can be used to model the precipitation efficiency in the Bayer process. For this purpose, experimental data on the parameters affecting the precipitation step were obtained from the Iran alumina factory. Several configurations were evaluated in the development of the machine learning algorithms, and the network performance was checked by changing the number of hidden layers, the number of neurons, the training algorithm, and the variance to obtain the best network for predicting precipitation.

According to some authors, ANN has been used to estimate alumina recovery in the Bayer process. Multiple linear regression analysis (MLRA) was used by Đurić et al. (2012) to model alumina recovery during the leaching process. They investigated the effect of bauxite composition, sodium aluminate solution composition, and caustic module composition on the degree of alumina recovery before and after the leaching process. In another study, radial basis function (RBF) and multilayer perceptron (MLP) as neural networks and the multiple linear regression (MLR) method were used to predict the alumina recovery efficiency (A.R.E.), the amount of produced red mud (A.P.R.), the red mud settling rate (R.S.R.), and bound soda losses (B.S.L.) based on the lime-to-bauxite ratio and chemical analyses in the Bayer process (Mahmoudian et al. Spring 2016). MLP, RBF, and MLR networks were used to estimate bound soda losses in Iran Alumina Complex solid residue (red mud) using regression and neural networks (AAN) (Ghaemi et al. 2018). \(\mathrm{Chelgani}\) et al. (2009) investigated the relationship between alumina leaching recovery and the chemical modules of bauxite using regression and ANN methods. Since the modeling of the precipitation stage in the Bayer process has not been done so far with machine learning approaches, an attempt was made to observe the performance of available operating parameters affecting the precipitation efficiency with two RBF and SVM networks.

In the present study, the modeling of precipitation efficiency in the Bayer process for alumina production in the Iran Alumina factory was investigated using data collected during the period 2020–2021 (521 days). The precipitation efficiency is predicted using regression and machine learning algorithms. To achieve this goal, radial basis function (RBF) and support vector machine (SVM) networks were used. The results of the two networks were compared in terms of the correlation coefficient (R2), mean square error (MSE), absolute average deviation (AAD), and absolute average relative deviation (AARD). Sensitivity analysis was applied to investigate the effect of input parameters on precipitation efficiency. The prediction performance is further demonstrated through model generalization inside the training data domain.

Practical implications and data set

The Hall–Heroult process converted alumina into the primary raw material for aluminum production. Alumina is typically extracted from bauxite using the Bayer process. Depending on the ore’s quality, one ton of alumina typically requires between 1.9 and 3.6 tons of bauxite. Digesting, clarifying, precipitating, and crystallizing are the main steps in the Bayer process (Paspaliaris et al. 1999b).

The precipitation step is the most crucial step in the Bayer process for producing alumina. The precipitation process is primarily influenced by the degree of aluminum oxide super-saturation in the input solution, variations in temperature during precipitation, the specific surface area of the bud, impurities in the liquor solution, and the mixing intensity in the precipitation tanks (Totten and MacKenzie 2003b). The production conditions influence the chemical quality and physical properties of alumina products, such as particle size, wear resistance, fracture, and shear properties. Understanding the mechanisms of the precipitation process, which include nucleation, growth, and agglomeration, is essential due to the conflict between the production of a high-quality product and a large quantity of the product.

The phenomena of nucleation, growth, and agglomeration necessitate that the solution be supersaturated with the dissolved substance. There are two distinct categories of nucleation: primary and secondary. Primary nucleation is accomplished in a supersaturated solution devoid of crystals of the soluble substance and is referred to as “nucleation.” Obviously, primary nucleation is also classified as homogeneous or heterogeneous. In homogeneous type, primary nucleation occurs in a solution devoid of solid particles, whereas in heterogeneous type, primary nucleation is caused by exogenous solid particles. Secondary nucleation involves the formation of nuclei in the vicinity and the presence of soluble material particles. By adding the bud to the solution, secondary nucleation is induced, making this form of nucleation the primary cause of nucleation during the precipitation process (Ilievski and Livk 2006). After nucleation in the precipitation process, particle growth occurs. Shape, behavior, size, purity, and tensile and wear resistance are the particle qualities that influence the growth stage. There are three distinct stages of crystal growth. The first stage, called volumetric diffusion, involves the growth units diffusing the fluid mass to reach the particles’ surfaces. The second stage, called surface diffusion, involves the growth units diffusing the particles’ faces. The third stage, called junction diffusion, involves the growth units diffusing the kink sites, where the energy is the lowest. The diffusion phase controls the rate in systems when the solubility of the dissolved substance is very high in the solvent. The surface reaction phase controls the growth process in low-solubility solutions. Agglomeration is a crucial step in the Bayer process that raises the original particle size. During this stage, the fine crystals added to the solution and the fine nuclei formed are connected. The solution must be supersaturated, which is one of the requirements for agglomeration (Bahrami et al. 2012b). According to the stated concepts, it can be concluded that the precipitation step is the most critical step in the aluminum hydroxide production process.

In order to model the precipitation efficiency with machine learning algorithms, the precipitation unit daily data set of the Iran Alumina Factory has been used from 2020 to 2021 (1560 data points from 521 days). As mentioned before, the precipitation unit efficiency is affected by various parameters. Due to the importance of the sodium oxide concentration (\({\mathrm{Na}}_{2}{\mathrm{O}}_{\mathrm{c}}\)), aluminum oxide concentration (\({\mathrm{Al}}_{2}{\mathrm{O}}_{3}\)), solid content, residence time, ambient/tank temperatures, their significant effect on the precipitation efficiency, and limitation of collected data from Iran Alumina Factory were used as input parameters for the network. It is obvious that the precipitation efficiency considers the output parameter of the network.

Various definitions have been proposed for precipitation efficiency, but the simplest definition is based on precipitation rate per volume according to Eq. (1) (Bahrami et al. 2012a):

$$\eta =\frac{{\alpha }_{\mathrm{final}}-{\alpha }_{\mathrm{primary}}}{{\alpha }_{\mathrm{final}}}\times 100$$
(1)

α (%) is the alumina module and is generally used to express efficiency as follows (Bahrami et al. 2012a):

$$\mathrm{\alpha }=1.645\times C/A$$
(2)

where \(C\) and \(A\) represent the \({\mathrm{Na}}_{2}{\mathrm{O}}_{\mathrm{c}}\) concentration and \({\mathrm{Al}}_{2}{\mathrm{O}}_{3}\) concentration in the solution phase (gr/lit), respectively.

The ranges of the data set are given in Table 2. To limit the range of a set of values, the data set must be normalized before being submitted to the network and regression models. As a result, the data were normalized using the following equations to adapt them to the transfer or activation function and place the input and output values in the same order between 0 and 1. (Ganguly 2003). The normalized values of the data are also given in Table 2.

$${x}_{\mathrm{normalized}}=s.x+o$$
(3)
$$s=\frac{{d}_{\mathrm{max}}-{d}_{\mathrm{min}}}{{x}_{\mathrm{max}}-{x}_{\mathrm{min}}}$$
(4)
$$o=\frac{{x}_{\mathrm{max}}{d}_{\mathrm{min}}-{x}_{\mathrm{min}}{d}_{\mathrm{max}}}{{x}_{\mathrm{max}}-{x}_{\mathrm{min}}}$$
(5)

Here, \({{\varvec{x}}}_{\mathbf{m}\mathbf{a}\mathbf{x}}\) and \({{\varvec{x}}}_{\mathbf{m}\mathbf{i}\mathbf{n}}\) are the minimum and maximum of data sets, respectively. \({{\varvec{d}}}_{\mathbf{m}\mathbf{a}\mathbf{x}}\) and \({{\varvec{d}}}_{\mathbf{m}\mathbf{i}\mathbf{n}}\) are the maximum and minimum desired values of the output range, respectively (\({{\varvec{d}}}_{\mathbf{m}\mathbf{a}\mathbf{x}}=1,\boldsymbol{ }{{\varvec{d}}}_{\mathbf{m}\mathbf{i}\mathbf{n}}=0\)). \({\varvec{s}}\) and \({\varvec{o}}\) are the scaling and offset parameters, respectively. Also, Eq. (3) can be used to return the network response from the normalized range to the original range (inverse parameter scaling). The normalized operating data were fed as inputs for training RBF and SVM, whereas normalized precipitation efficiency was kept as an output.

Table 2 Data set’s range and normalized value

Network model implementation

The artificial neural network (ANN) is a modern computational method for machine learning, knowledge demonstration, and the use of learned knowledge to predict output responses from complex systems. The main idea of ANN is based on the biological neural system method for data processing and information learning (JingTao and Tan 2001; Yao et al. 1999). The key component of this idea is the creation of new structures for data processing systems. The structure is made up of a large number of interconnected processing elements called neurons that operate together to model the system. Radial basis function (RBF), support vector machine (SVM), and multilayer perceptron (MLP) networks are the most common and practical structures of machine learning approaches. Figure 1 shows the work cycle algorithms of RBF and SVM networks.

Fig. 1
figure 1

Work cycle algorithms a RBF and b SVM networks

The goal of this study is to use RBF and SVM to predict precipitation efficiency in the Bayer process. Four performance metrics were computed for each series to compare the results between the stated networks: the regression coefficient (R2), mean square error (MSE), absolute average deviation (AAD), and absolute average relative deviation (AARD). The regression coefficient (R2) is stated by (Heidari et al. 2016):

$${R}^{2}=\frac{\frac{{\sum }_{i}\left({x}_{i}-\overline{x }\right)\left({d}_{i}-\overline{d }\right)}{N}}{\sqrt{\frac{{\sum }_{i}{\left({d}_{i}-\overline{d }\right)}^{2}}{N}}\sqrt{\frac{{\sum }_{i}{\left({x}_{i}-\overline{x }\right)}^{2}}{N}}}$$
(6)

The following equations define the mean square error (MSE), absolute average deviation (AAD), and absolute average relative deviation (AARD), respectively (Baş and Boyacı 2007; Heidari et al. 2016):

$$\mathrm{MSE}=\frac{1}{N}\sum_{i=1}^{N}{\left({y}_{i}-{y}_{di}\right)}^{2}$$
(7)
$$\mathrm{AAD}=\frac{1}{N}\sum_{i=1}^{N}\left(\left|{y}_{i}-{y}_{di}\right|\right)$$
(8)
$$\mathrm{AARD}=\left(\left(\sum_{i=1}^{N}\left|{y}_{i}-{y}_{di}\right|/{y}_{i}\right)/N\right)\times 100$$
(9)

The experimental and computed responses are \({y}_{i}\) and \({y}_{di}\), respectively, and N is the number of experimental data. (N = 1560 data points). Due to high-temperature changes during the year, the data were divided into two parts, and the modeling for the first half of 2020 and 2021 (1023 data points) and the second half of 2020 (537 data points) was done separately.

Radial basis function (RBF)

The radial basis function was used in 1988 due to its generalizability and simple structure, which prevented extended and unnecessary calculations. RBF is made up of three precursor layers that receive the input vector in the first layer and propagate the results to the middle layer to generate the network output in the last layer. In RBF, the general form of the approximation function is presented as follows (Vt and Shin, 1994):

$$G(x)=\sum_{i=1}^{M}{w}_{i}\varphi (\Vert x-{c}_{i}\Vert )$$
(10)
$$\varphi (s)={e}^{-\frac{{s}^{2}}{2{\sigma }^{2}}}$$
(11)

Weights (\({w}_{i}\)) can be obtained using the least squares method by having or estimating \({c}_{i}\) and \(\sigma\) for each intermediate neuron.

Support vector machine (SVM)

Support vector machine (SVM) is a highly supervised machine learning technique that was originally developed for organization problems and then extended to regression tasks in 1995. In order to minimize error, SVM uses structural risk minimization (SRM) principles that work better than experimental risk minimization (ERM) principles (Andras 2002). By selecting a function, SVM estimates the target’s true value as close to the reference value as possible. Then, it measures the overall pattern and the maximum permissible deviation of the targets from the estimated values for all training data. Finally, the input parameters are transferred to the multidimensional region using a nonlinear mapping function of the Gaussian nucleus, in which a linear model is estimated according to Eq. (13) (Byvatov et al. 2003):

$$f\left(x\right)=\sum_{i=1}^{l}\left({a}_{i}^{*}-{a}_{i}\right).\varphi \left({x}_{i}\right).\varphi \left({x}_{j}\right)+b$$
(12)

where \(\varphi \left({x}_{i,j}\right)\) is the internal multiplications of the vectors in the multidimensional region, which can be represented by the kernel function according to Eq. (14) (Byvatov et al. 2003).

$$f\left(x\right)=\sum_{i=1}^{l}\left({a}_{i}^{*}-{a}_{i}\right).K\left({x}_{i},{x}_{j}\right)+b$$
(13)
$$K\left({x}_{i},{x}_{j}\right)=\varphi \left({x}_{i}\right).\varphi \left({x}_{j}\right)$$
(14)

where \(l\) represents the support vectors number, \({a}_{i}^{*}\), \({a}_{i}\), and \(b\) are the Lagrange coefficients and bias, respectively, and \(K\left({x}_{i},{x}_{j}\right)\) is the kernel function.

Results and discussion

Machine learning approaches are a set of interconnected mathematical neurons that create a model of complex functional relationships. With the weights and biases specified during training, this model should be able to compute output values from input values derived from internal calculations.

Application of RBF

Designing RBF networks depends on adjusting the radial function’s parameters to provide the best possible approximation error for training data. Determining the number of radial, center, width, or variance functions and the weight coefficients for these functions is the key problem in designing such networks. The spread factor is used to measure the dispersion of data. The spread factor narrows the available options, whereas a higher number provides more flexibility. The centers of the basic functions need not be fully adapted to training samples, but they should be distributed uniformly through the feature vector space. To put it another way, RBF networks conceptually provide a local approximation of the main function in the neighborhood of each center, and by combining these local approximations, a general approximation of the considered function is supplied throughout the entire feature vector space.

Typically, the RBF network learning process consists of two steps: (i) Learning the centers and widths of radial functions in the hidden layer; (ii) Learning the weights connecting the middle layer to the output layer. Neurons in an RBF network are often chosen through trial and error. The learning method is initialized with a large number of neurons in the hidden layer, and then the number of neurons is gradually decreased. As the number of neurons in the network is reduced, the training method converges to the desired error, which may be determined by testing the network with its untrained inputs. Table 3 depicts the design of the RBF network and the results of the statistical tests (R2, MSE, AAD, AARD, and run time) that compare the actual and predicted precipitation efficiency for the training and test data, regardless of the number of neurons and spread factor.

Table 3 Design of the RBF network and the results of the statistical tests (R2, MSE, AAD, AARD, and run time), regardless of the number of neurons and spread factor

The spread factor, or variance, and the optimum number of neurons are the parameters gained by trial and error in RBF training. 85% and 15% of the total data were used for training and testing, respectively. Table 4 shows the optimal RBF network layout and the results of statistical tests (R2, MSE, AAD, AARD, and run time) comparing actual and predicted precipitation efficiency for training and testing data. As a result, the effect of these two parameters on RBF network optimization can be observed.

Table 4 Comparison of RBF network performance in different spread factor with 100 and 53 neurons and different neurons with spread 0.1 and 0.2 for first half and second half of the years, respectively

Then, the RBF network with a spread factor of 0.1 and 100 neurons for the first half of 2020 and 2021 and a spread factor of 0.2 and 53 neurons for the second half of 2020 was trained for the inputs and target data of the precipitation process. The results of statistical tests between the experimental and predicted precipitation efficiency for training and testing data are shown in Fig. 2. It should be noted that the proposed ANN model’s parameters, such as neuron values, weights, and biases, are accessible via an Excel file provided as supplementary material.

Fig. 2
figure 2

RBF network regression performance for precipitation efficiency: a training and b testing data for the first half of 2020 and 2021; c training and d testing data for the second half of 2020

Application of SVM

The support vector machine uses a method called the kernel trick. This method contains functions that convert the input space of low dimension to the space of higher dimension. This transformation turns a non-separable problem into a separable problem. These are referred to as kernel functions. Kernel functions are more useful in nonlinear separation problems. These functions transform some extremely complex data and then find a process that can be used to separate the data based on some user-defined labels.

Typical methods for RBF network design are unnecessary for the SVM network. The numbers of radial functions utilized, as well as the centers of those functions, determine the feature space dimensions in this network automatically. The complexity of the problem is independent of the dimensions of the data, and the number of hidden layers and weights are chosen automatically.

In order to model the system with the support vector machine network and change the network settings, such as selecting the kernel function, initial guessing of alpha values, data standardization, and kernel function scale, appropriate expressions can be applied to the network in the form of the input argument of the SVM function. As mentioned earlier, a support vector machine is used for an optimization problem. In addition, modeling software has the ability to find the optimal parameters for this network. First, the network is run with the following command to determine the optimal values. Table 5 shows the optimal values of the SVM network for two data sets.

Table 5 Optimal values of SVM network parameters for two data sets

So, the selected values can be used for optimal network design according to Table 5. The SVM network has different solvers. To find the correct solver with minimum MSE and maximum R2, the SVM network was performed with different types of solvers.

As shown in Table 6, the ISDA solver has the best performance among the other solvers. Therefore, using this solver and the optimal values in Table 5, the network regression diagram for precipitation efficiency is shown in Fig. 3.

Table 6 Comparison of SVM network performance with different solvers for two data sets
Fig. 3
figure 3

SVM network regression performance for precipitation efficiency a training and b testing data for the first half of 2020 and 2021, c training and d testing data for the second half of 2020 with the ISDA solver

Therefore, the SVM network was first obtained with optimal values and trained with a Gaussian function. Then, regression was performed between network output data and target values (experimental), and the regression coefficient was 0.99914 and 1 for the two data sets, respectively.

Sensitivity analysis

As can be seen from the previous sections, the RBF network has the best performance compared to the SVM network. In addition, the RBF convergence was guaranteed due to the same results in each run as opposed to the MLP network and the lack of need to examine the network convergence, unlike the SVM network. The longer run time of the RBF network than the SVM network was ignored, and it was selected as the optimal network. Finally, this network was used for sensitivity analysis.

Because ANN is a black-box model, it cannot provide evidence about what is going on during the process. In most circumstances, determining which parameter has the highest impact on process performance is required. A well-trained network can analyze the relevancy importance or factor (r) of each input parameter. As the relevancy factor increases, the effect of input parameters increases. The relevance factor is calculated from Eq. (15) (Heidari et al. 2016; Mhurchú and Foley 2006):

$$\mathrm{r}=\frac{{\sum }_{i=1}^{N}({a}_{z,i}-{a}_{z,\mathrm{average}})({Y}_{i}-{Y}_{\mathrm{average}})}{\sqrt{{\sum }_{i=1}^{N}{({a}_{z,i}-{a}_{z,\mathrm{average}})}^{2}\times {\sum }_{i=1}^{N}{({Y}_{i}-{Y}_{\mathrm{average}})}^{2}}}$$
(15)

where \(a\) and \(Y\) are the input and output parameters, respectively.

This ratio provides information about a process that is typically obtained through white-box modeling and is useful for optimization and control. In order to maintain high requirements in modeling, it is important for the modeler to provide a confidence assessment. To do so, it is necessary to first conduct an analysis of the level of uncertainty associated with the outcomes of the model (uncertainty analysis) and then determines the relative importance of the various inputs in generating that uncertainty. The second of these problems is tackled by this technique called sensitivity analysis (uncertainty analysis is typically a prerequisite), which ranks the inputs’ intensities and relevance in determining the output’s variability.

Figure 4 demonstrates that the solid content, concentration of Na2Oc, and concentration of Al2O3 are all in the same range (0.130–0.145). This implies that the influence of these parameters on precipitation efficiency is very similar. Ambient temperature was predicted to have a greater effect on precipitation efficiency than other inputs because of the wide range of variation in ambient temperature data available from industry. As shown in Fig. 4, the relative factor of ambient temperature has the highest value (0.18). It follows that the relative factor is profoundly affected by the magnitude of the changes that can be made to a quantity. This is also why factors like tank temperature and residence time have a relatively minimal impact.

Fig. 4
figure 4

Relative factor of operating parameters on precipitation efficiency. The error bars represent the standard deviation for 100 runs

Model generalization

After choosing the best network with the optimal number of neurons and spread factor, it can be used to predict precipitation efficiency for a variety of inputs within the training data domain. In 3D plots, the precipitation efficiency is plotted against operating parameters. The generalization performances of the RBF network, as shown in Fig. 5, illustrate no oscillations, confirming the network’s appropriate prediction capability. Figures 5a–h, represent the effects of various input operating parameters on precipitation efficiency.

Fig. 5
figure 5

Model generalization of optimal network, effect of a concentration of Al2O3 and Na2Oc; b tank temperature and concentration of Al2O3; c residence time and tank temperature; d solid content and ambient temperature on precipitation efficiency for the first half of 2020 and 2021, while other factor are kept constant at 129.6 gr/lit Na2OC concentration, 130.10 gr/lit Al2O3 concentration, 57 °C tank temperature, 13 °C ambient temperature, 5.50 h residence time and 379 gr/lit solid content. effect of e solid content and residence time; f ambient and tank temperatures; g concentration of Al2O3 and Na2Oc; h tank temperature and concentration of Al2O3 on precipitation efficiency for the second half of 2020, while other factors are kept constant at 124 gr/lit Na2OC concentration, 121.4 gr/lit Al2O3 concentration, 56.3 °C tank temperature, 17.5 °C ambient temperature, 5.42 h residence time and 321 gr/lit solid content

The data gathered for this research originates from Jajarm’s alumina hydroxide precipitation unit and analyzes the parameters that influence the efficiency of precipitation-based production. Production efficiency is directly affected by the parameters of temperature, solid content, residence time, aluminum oxide concentration, and sodium oxide concentration; however, some of these variables have an overlapping influence on each other, as is seen in the presented results.

According to the 3D plots, the ideal temperature range for the precipitation process is between 55 °C and 65 °C, where the precipitation effectiveness is suitable. In accordance with Eq. (16), an increase in temperature increases the growth rate, and in accordance with Eq. (17)), it decreases super-saturation. A temperature range has been proposed to improve efficiency based on the interaction between these two variables (Ying Zhang 2009).

$$-\left(\frac{1}{S}\right)*\left(\frac{d\sigma }{dt}\right)=k{\sigma }^{n}$$
(16)
$$\sigma =\frac{A-{A}_{\mathrm{eq}}}{{A}_{\mathrm{eq}}}$$
(17a)
$${A}_{\mathrm{eq}}=C*\mathrm{exp}\left(6.2106-\frac{2486.7}{T+273.15}+\frac{1.0875}{T+273.15\times C}\right)$$
(17b)

where \(\sigma\) is the degree of super-saturation, \(k\) is the growth rate constant, \(n\) is the degree of reaction, \(S\) is the total surface area of the particles, \(T\) is the temperature (\(^\circ{\rm C}\)), \(C\) is the caustic concentration (\(\mathrm{gr}/\mathrm{lit}\)), and \({A}_{eq}\) is the equilibrium solubility of aluminum (\(\mathrm{gr}/\mathrm{lit}\)).

Due to the inclusion of Al2O3 in the calculation of alumina module, \(\alpha\) (Eq. (2)), the concentration of Al2O3 has a direct effect on precipitation efficiency, as shown by the results of the Al2O3 concentration level. Increasing the amount of Al2O3 causes \(\alpha\) to decrease at the inlet and increase at the outlet of the tank (super-saturation to increase), which in turn improves precipitation efficiency. The direct effect of Na2Oc on \(\alpha\) follows the same pattern as the prior experiment (concentration of Al2O3). When Na2Oc is reduced, \(\alpha\) is decreased at the tank’s entrance (super-saturation is raised), and precipitation efficiency improves.

The precipitation process is based on mass transfer from the solution phase to the solid phase, and as stated earlier, this process consists of the time-consuming stages of nucleation, growth, and agglomeration. It takes between 4 and 5.5 h, depending on the temperature, the degree of super-saturation, and the concentration. The most efficient precipitation rates are achieved during this timeframe.

According to the liquor input to the precipitation tanks and the mass transfer of Al2O3 from the solution phase to the solid phase during the precipitation process, the solid content in the solution is low, the degree of super-saturation of the liquor solution is high, and as a result, precipitation efficiency is improved.

The super-saturation rate and precipitation efficiency increase as the Na2Oc concentration rises, and the Al2O3 concentration falls at the reactor’s inlet. Similarly, as the temperature difference increases, less Al2O3 remains in the liquor solution, leading to faster mass transfer. As a result, as tank and ambient temperatures rise, so does precipitation efficiency. Mass transfer and precipitation efficiency improve with a higher residence time, which gives more opportunity to the nucleation, growth, and agglomeration steps. As the solid content of the liquor solution decreases, more Al2O3 dissolves in the solution and consequently increases its separation rate. As a result, lowering the solid content improves precipitation efficiency.

Optimization of the precipitation efficiency

Precipitation efficiency must be maximized in the Bayer process. These input parameters, however, do not respond the same within the operating condition domain, as indicated by 3D plots in Fig. 5. Raising the tank temperature, for example, increases the precipitation efficiency, whereas increasing the ambient temperature decreases the precipitation efficiency. In the Bayer process, there are two methods for determining the optimal operating parameters for precipitation efficiency: (i) minimizing the concentration of Na2Oc, and (ii) limiting the total consumed energy in the reactor.

The optimization technique for the aforementioned process in an industrial precipitation reactor is based on minimizing the concentration of Na2Oc. According to the sensitivity analysis diagram, Na2Oc has a significant effect on precipitation efficiency, and we can control this parameter in the Bayer process. The trained network, which was used to estimate precipitation efficiency in an industrial reactor, may be securely employed to predict the optimal values of input parameters in the Bayer process to get the best outcomes. When the network’s input is in the range of the minimum to maximum operating conditions, we can trust the anticipated output data by employing the trained network.

For this purpose, a new data set containing various network input parameters was produced as the main and available parameters that affect reactor operating conditions (concentration of Na2Oc, concentration of Al2O3, tank and ambient temperatures, residence time, and solid content). All of the data were within the range of the reactor’s actual working conditions. Five of the six parameters in the generated data set were held constant, while the sixth parameter was adjusted throughout a variety of actual operating conditions. This was done for each of the six parameters, and a total of 1800 data points were collected. The network was then employed to estimate precipitation efficiency using this data set as input.

The 1800 data points used in the optimization of precipitation efficiency are proposed points. These 1800 data points fall between the minimum and maximum of industrial data. Since industrial data are reliable, selecting 1800 points at random for each input is also trustable. Finally, it is possible to assert that the 1800 data points between the minimum and maximum ranges of reliable industrial data were generated at random. In addition, the machine learning-based model has made accurate predictions for this range of industrial data. Therefore, 1800 data points or any number of data points selected from industrial data within the interval of minimum and maximum inputs are reliable.

Precipitation efficiency is defined as the concentration of Na2OC and Al2O3 in the entered feed, as previously stated. This quantity must not exceed a set limit and must be adjusted in accordance with the required specification. Therefore, based on the available data of the Jajarm Refinery Company, the minimum and maximum calculated precipitation efficiency should be 40% and 49%, respectively. The best parameters in the Bayer process were explored within the data set with the lowest permissible value (40%) of precipitation efficiency to minimize total production cost. As given in Table 7, the process’s optimum conditions were chosen as follows: concentration of Na2OC = 107.3 g/lit; concentration of Al2O3 = 128.34 g/lit; tank temperature = 55.93 °C; ambient temperature = −1.5 °C; residence time = 5.22 h; and solid content = 254.22 g/lit for the first half of 2020 and 2021; and concentration of Na2OC = 107.9 g/lit; concentration of Al2O3 = 128.81 g/lit; tank temperature = 56.31 °C; ambient temperature = −3 °C; residence time = 5.13 h; and solid content = 275.43 g/lit for the second half of 2020.

Table 7 Precipitation efficiency under various operational conditions with a precipitation efficiency of 40% and the minimum Na2Oc concentration (the optimum point shown in bold)

Since the sensitivity analysis demonstrated that the ambient temperature had a significant impact on the precipitation efficiency, it was considered one of the network inputs, despite the fact that the controller had no control over it. Two cases with constant ambient temperature are investigated to provide a set of best operating conditions for various fixed ambient temperature values: 1. the case of the first half of 2020 and 2021 with the average ambient temperature, 2. the case of the second half of 2020 with the average ambient temperature. Then, the RBF network is used to determine the optimal available values for the remaining parameters based on these two cases. Table 8 displays the precipitation efficiency under optimum operating conditions and a constant ambient temperature for the first half of 2020 and 2021 and the second half of 2020.

Table 8 Precipitation efficiency under optimum operating conditions and a constant ambient temperature for the first half of 2020 and 2021 and the second half of 2020 (the optimum point shown in bold)

Conclusion

In this paper, the efficiency of precipitation as a function of operating parameters was explored and modeled using machine learning algorithms. The model input parameters were concentration of Na2Oc, concentration of Al2O3, tank temperature, ambient temperature, residence time, and solid content. The results show that the machine learning approaches may be employed safely in the Bayer process for modeling precipitation efficiency. The findings of the modeling revealed that the estimated outcomes and industrial data for the training and testing data sets are in good agreement. The general results of the study are summarized below:

  • In order to find an appropriate algorithm and predict the precipitation efficiency, the structures of RBF and SVM networks were examined.

  • Based on the trial-and-error algorithm, the RBF network with a spread factor of 0.1 and 0.2 and a number of neurons of 100 and 53 for the first half of 2020 and 2021 and the second half of 2020, respectively, was chosen as the best RBF structure to predict precipitation efficiency within the training data domain.

  • The average relative error in predicting the precipitation efficiency of training points was less than 0.001%.

  • ANN’s model generalization within the domain of training data also exhibits superior prediction ability.

  • The evaluated model can be used to determine the optimal operating conditions for the Bayer process based on the minimum concentration of Na2Oc by generating new datasets covering a broad range of possible operating conditions.

  • The comparison findings (R2, MSE, AAD, AARD, and run time) demonstrated the network’s capacity to predict precipitation efficiency accurately over a wide range of operating parameters.