Introduction

The need to improve the efficiency of engineering systems in various sectors such as chemical, mechanical, electronic and marine industries is rapidly increasing. There has been a great deal of research in these fields over the past years published as journal or conference papers [1,2,3,4]. These research spans from investigations studying heat transfer rate in human body [5, 6] to those looking for ways of increasing heat transfer coefficient in heat exchangers and industrial heat recovery steam generators [7,8,9,10] and fuel cells [11]. The use of nanofluids which are mixtures of a basic fluid with metallic or non-metallic solid particles at nanoscale has attracted the attention of many scientists during these years [12]. For instance, Choi [13] introduced the use of nanoparticles with base fluids to tackle the problem of low thermal conductivity of fluids. In this investigation, the need for more energy-efficient techniques was addressed by using copper nanoparticles. The results revealed considerable reduction in pumping power in heat exchangers. Thereafter, much research has been done that emphasizes the importance of using nanofluids to improve thermal performance of systems. Nanofluids are able to modify heat transfer of devices with various dimensions [14, 15]. Some studies focused on the use of nanofluids in conventional channels with macroscale dimensions [16,17,18] while others studied utilization of nanofluids in microchannels [19,20,21,22]. The effect of different nanoparticles types and various base fluids was studied as well [23,24,25].

It is important to know the thermophysical properties of nanofluids in order to model their heat transfer in different types of mediums such as porous or conventional [26,27,28,29,30,31]. Thus, a lot of research has focused on prediction of thermal conductivity of nanofluids. In a review article by Ahmadi et al. [32], it was shown that an increase in concentration and temperature of nanoparticle results in higher thermal conductivity of nanofluids. Utilization of hybrid nanofluids was suggested to improve the thermophysical properties of the nanofluid as well. Yildiz et al. [33] studied experimental and theoretical thermal conductivity model on thermal performance of a hybrid nanofluid. Al2O3–SiO2/water hybrid nanofluid was used in their study. It was revealed that the use of hybrid nanoparticles can lead to the same heat transfer rate at lower particle volume fractions. In an experimental study by Shamaei et al. [34], it was shown that the increment in temperature and solid phase fraction of nanofluid can considerably increase its thermal conductivity. Different volume fractions and temperatures of double-walled carbon nanotubes (DWCNTs) suspended in ethylene glycol (EG) were considered. Cacua et al. [35] studied thermal conductivity of TiO2/water nanofluid experimentally. During measurements of thermal conductivity, agitation of nanofluid was investigated, and it was shown that this can have considerable impact on the results since a great amount of nanoparticles can adhere to experimental apparatus. In addition, it was shown that thermal conductivity of the nanofluid increases markedly as the temperature rises.

The use of intelligence methods in determining thermal conductivity of nanofluids has become prevalent in recent years [36,37,38,39]. Artificial neural network (ANN) and support vector machine (SVM) are two common methods for modeling engineering problems [40]. Thermal conductivity of a nanofluid is dependent to various parameters like concentration and dimensions of nanoparticles as well as its temperature [41]. Group method of data handling (GMDH) and multilayer perceptron ANNs are broadly applied for estimating and forecasting the properties of nanofluids [41, 42]. In a study by Komeilibirjandi et al. [43], mathematical correlation and GMDH were investigated to predict nanofluids’ thermal conductivity which had CuO nanoparticles. It was shown that the R-squared and the average absolute relative deviation for the artificial neural network method are more appropriate for prediction of thermal conductivity of nanofluids with CuO nanoparticles. Vafaei et al. [44] conducted an investigation of thermal conductivity of MgO-MWCNTs/EG hybrid nanofluid. Initially, thirty-six data set from experimental tests was considered. The temperature of the hybrid nanofluid was in the range of 25–50 °C, and six different nanoparticle volume fractions from 0.05 to 0.6% were considered. They studied four various artificial neural networks from 6 to 12 neurons in hidden layer. The results showed that the model of 12 neurons in hidden layer has the most favorable result with the maximum derivation of 0.8%. Different factors including the training algorithm of ANNs, their type and number of hidden neurons influence the reliability of the proposed models. Depending on the nature of the problem, these parameters can be obtained.

One of the nanoparticles which is widely employed in heat transfer fluids is silica (SiO2). Nanofluids with SiO2 are employed in different thermal devices such as solar collectors, heat exchangers and heat pipes. Favorable stability of these particles in different base fluids is one of its main advantages. In this study, thermal conductivity of nanofluids with SiO2 particles is modeled by employing different ANN-based approaches. The selected methods for the modeling are GMDH and multilayer perceptron with two different training algorithms. The most important novelties of the present research are consideration of various base fluids and effect of size on the output of the model. Moreover, the performance of different ANN-based approaches is compared and investigated in terms of MSE and R-squared values. Finally, a correlation, based on GMDH, is proposed to determine the thermal conductivity of the considered nanofluids.

Methods

In this study, GMDH in addition to a conventional type of ANN with two training algorithms is applied for modeling the thermal conductivity of nanofluids with SiO2 nanoparticles. These algorithms are selected for modeling since they have shown reliable outputs in several researches in this field. Mathematically, GMDH algorithm is based on the Volterra functional series in the form of bivariate quadratic polynomial as represented in Eq. (1) [45, 46].

$${y}_{{\rm i}}=f\left({x}_{{\rm i}},{x}_{{\rm j}}\right)={a}_{0}+{a}_{1}{x}_{{\rm i}}+{a}_{2}{x}_{{\rm j}}+{a}_{3}{x}_{{\rm i}}{x}_{{\rm j}}+{a}_{4}{x}_{{\rm i}}^{2}+{a}_{5}{x}_{{\rm j}}^{2}$$
(1)

In this equation, the Volterra series is changed into a set of chain recursive equations; therefore, the Volterra series is regenerated by the algebraic repositioning of each of the recursive equations. Complex systems that include m input and an output variable can be decomposed to \({{c}_{{\rm m}}}^{2}=\frac{m(m-1)}{2}\) simple partial system with two inputs and one output. However, all the outputs are considered the same and equal to that of the complex system. To combine two partial systems into a single system and to form a new system that has variables from both former systems, it is sufficient to remodel the output of models with n inputs. GMDH algorithm performs modeling by applying two rules. The first one is to obtain a model in which almost all the variables of the system are represented. The second requirement is to obtain a model whose output error is lower than that of the other models that are calculated in the previous steps.

GMDH neural network is a self-organizing and one-directional network that has different layers in which each layer has several neurons. The neurons’ structures are similar. Weights (w) are set in each neurons as specific and constant values on the basis of singular value decomposition (SVD) and solving normal equation (SNE) techniques. The prominent feature seen in this type of network is that the neurons of the previous layer (m) are responsible for generation of \({{c}_{{\rm m}}}^{2}\) new neurons. Among the generated neurons, some of them must be eliminated to prevent divergence of the network. These neurons are called dead neurons. This algorithm is described in more details in Refs. [47, 48].

In addition to GMDH multilayer perceptron (MLP), ANN is applied for modeling. The procedure of this algorithm has been explained in details in several articles [41, 42]. Various factors such as number of hidden layers and neurons in addition to the utilized training function affect the performance of these models. In the current study, two training algorithms including Levenberg–Marquardt and scaled conjugate gradient are used. Since the problem is not very complex, just one hidden layer is applied in the models. Different numbers of neurons from 3 to 10 are tested to determine the best one on the basis of mean square error (MSE) value. The appropriateness of the models and their accuracy are assessed on the basis of R-squared and MSE which are defined as [43]:

$${R}^{2}=1-\frac{\sum_{i=1}^{i=n}{\left({y}_{{\rm i}}^{{\rm experimental}}-{y}_{{\rm i}}^{{\rm predicted}}\right)}^{2}}{\sum_{i=1}^{i=n}{\left({y}_{\text{i}}^{{\rm experimental}}-\overline{{{y}^{{\rm experimental}}}}\right)}^{2}}$$
$$MSE=\frac{1}{n} \sum\limits_{i=1}^{i=n}{\left({y}_{{\rm i}}^{{\rm experimental}}-{y}_{{\rm i}}^{{\rm predicted}}\right)}^{2}.$$

Results and discussion

As indicated in the previous section, GMDH is one of the methods employed for thermal conductivity modeling. The data for modeling are obtained from various experimental studies which have investigated the thermal conductivity of nanofluids with silica particles and different base fluids such as water, ethylene glycol, glycerol and mixture of water and ethylene glycol [49,50,51]. The ranges of variables are shown in Table 1. For this method, the data sets were divided into two groups known as test and training. Here, 30% of data sets were randomly utilized for testing the trained networks. The obtained function for the thermal conductivity on the basis of the considered inputs is provided in Appendix, where \({x}_{1}\), \({x}_{2}\), \({x}_{3}\) and \({x}_{4}\) are base fluid thermal conductivity (W m−1 K−1) at 25 °C, mean size of particles (nm), volume fraction of silica nanoparticles in base fluid (%) and temperature (°C), respectively. In Fig. 1, the values of thermal conductivity in actual condition (measured in experimental studies) are compared with outputs of the applied model. The value of \({R}^{2}\) for the case of applying GMDH is 0.9997.

Table 1 Variable ranges in the considered experimental studies
Fig. 1
figure 1

Comparison of the model outputs and measured values for GMDH-based model

The value of \({R}^{2}\) must be between 0 and 1. Closer values of \({R}^{2}\) to the unity demonstrate higher model reliability and lower differences between the actual values and the quantities obtained by the regression. In addition to this parameter, using relative error can be useful to assess the precision of the model. The maximum absolute value of this factor for the model proposed by using GMDH is approximately 1.75%, as shown in Fig. 2.

Fig. 2
figure 2

Relative error of the predicted data for GMDH-based model

Similar to numerical and computational approaches [52,53,54], different parameters influence the output of model. Training algorithm is one the most important ones. Various training algorithms have been employed in multilayer perceptron ANNs. In this study, scaled conjugate gradient (SCG) and Levenberg–Marquardt (LM) have been tested. In addition to the training algorithm, ANN structure such as number of neurons influences the model performance. In this study, according to the complexity degree of the problem, one hidden layer is considered. Furthermore, different numbers of neurons in range of 3 and 10 were tested to find the most accurate one. The criterion used for comparing the models is MSE value. In Table 2, MSE values for different numbers of neurons for the case of using SCG algorithm are represented. The minimum value of MSE, indicating the highest reliability, is obtained for 8 neurons in the hidden layer.

Table 2 MSE values for different numbers of neurons for SCG-based model

Since using 8 neurons results in the most favorable forecast, the outputs of the model are compared for this case. As illustrated in Fig. 3, \({R}^{2}\) is equal to 0.9991. Similar to GMDH-based model, the model is reliable based on its \({R}^{2}\) value since it is very close to 1. The relative error of the forecasted values is represented in Fig. 4. In the case of using multilayer perceptron with SCG training algorithm, the highest absolute relative error is approximately 4%.

Fig. 3
figure 3

Comparison of the model outputs and measured values for SCG-based model

Fig. 4
figure 4

Relative error of the predicted data for SCG-based model

Similar to the case of using SCG training algorithm, various numbers of hidden neurons are checked for LM algorithm. In Table 3, MSE values for different numbers of neurons are represented. In the case of employing LM for network training, the minimum value of MSE is obtained by applying 9 neurons in the hidden layer which is equal to 0.0000078. In comparison with the cases of utilizing SCG algorithm, MSE values of LM algorithms are lower which reveals more accurate performance of LM in training the network.

Table 3 MSE values for different numbers of neurons for LM-based model

In Fig. 5, the determined data by LM method are compared with the measured thermal conductivities in the previous experimental studies. As shown in this figure, \({R}^{2}\) is 0.9998, which is very close to 1. In Fig. 6, relative error of the model that used LM algorithm for training is shown. In this condition, the maximum absolute relative error is approximately 3%, which is higher than the corresponding value of the GMDH-based model.

Fig. 5
figure 5

Comparison of the model outputs and measured values for LM-based model

Fig. 6
figure 6

Relative error of the predicted data for LM-based model

In Fig. 7, MSE values of the different models are represented. As represented in this figure, the highest value of MSE belongs to the multilayer perceptron ANN with SCG training algorithm. Moreover, it can be noticed that using LM algorithm for training leads to the lowest MSE quantity. Despite the higher absolute relative error of the model in the case of using LM training algorithm compared with the GMDH approach, its MSE value is lower. This is due to lower deviation in the majority of predicted values in the case of applying LM.

Fig. 7
figure 7

MSE values of different approaches

Conclusions

Nanofluids containing silica particles are employable in thermal engineering since they have acceptable stability and favorable thermal properties. In the current article, MLP and GMDH neural networks are applied for thermal conductivity modeling. For the cases of applying MLP, two training algorithms including Levenberg–Marquardt and scaled conjugate gradient are evaluated. The main findings of the current study are summarized as follows:

  • All of the trained ANNs are able to properly model thermal conductivity of the nanofluids.

  • The accuracy of the proposed models is dependent on the training algorithm, utilized approach as well as network structure.

  • The minimum value of MSE, which is approximately 0.0000078, is observed in case of applying ANN with Levenberg–Marquardt training algorithm.

  • The values of \({R}^{2}\) in the cases of applying LM, SCG and GMDH are 0.9998, 0.9991 and 0.9997, respectively.

  • Using 8 and 9 neurons in the hidden layer of trained networks with SCG and LM, respectively, results in obtaining the most accurate models.

  • In all of the cases, the maximum relative deviations of the models does not exceed 4%, which reveals their acceptable reliability.