1 Introduction

Trichlorofluoromethane, R11 was supposed to be the first widely used refrigerant as low-pressure centrifugal chillers. R-11 is a non-corrosive, non-toxic, and non-flammable refrigerant. Its low operating pressures and relatively high compressor displacement require the use of a centrifugal compressor. Selection of refrigerant based on initial and operating costs, system design, size, safety, reliability, serviceability etc. is important for applications such as microprocessor cooling, cooling of high power electronic equipment, compact heat exchangers, and even compact fuel cells. Even though its use has been discontinued in recent days due to its high ozone depleting potential but enormous amount of experimental data is available in literature that help to understand the mechanism of flow boiling in mini-channels. In the present work, a part of this data has been used to assess the applicability of SVR to predict the heat transfer coefficient in mini-channels for both the subcooled as well as the nucleate boiling regions. Future work would entail the use of data on eco-friendly refrigerants to develop more robust soft computing models. Based on the hydraulic diameter, the flow channels have been grouped into the conventional macro-channels, micro-channels and mini-channels as follows [1,2,3,4]:

  • Conventional channels: Dh ≥ 3 mm

  • Mini-channels: 3 mm ≥ Dh > 200 μm

  • Micro-channels: 200 μm ≥ Dh > 10 μm

Flow boiling heat transfer in mini-channels has attracted the interest of many researchers. The use of mini and micro-channels with suitable fluid could provide high heat transfer; reduce the cost of material and save space. It has been found from the literature that the flow boiling is governed by both nucleate boiling and forced convective heat transfer mechanisms. The heat transfer coefficient in nucleate boiling regime is mainly dependent upon heat flux and mass velocity and vapor quality has negligible effect on heat transfer coefficient while in the forced convection boiling dominant regime, the heat transfer coefficient has been found to be dependent on mass velocity and vapor quality, and is less sensitive to heat flux [5]. Furthermore, the forced convection boiling region is usually related to the annular flow pattern, and the nucleate boiling region with the bubbly and slug flow patterns [6, 7]. So, mass velocity, vapor quality, and heat flux, and flow pattern help in recognizing the dominant heat transfer mechanism in a micro-channel/mini-channel.

A good design of heat transfer equipment requires accurate prediction of flow boiling heat transfer coefficient, as this parameter is responsible for the surface area and thereby the weight and cost of the equipment. A number of correlations in the literature can be found for the prediction of flow boiling heat transfer coefficient [8,9,10,11]. But these correlations have some limitations for their range of applicability under different operating conditions. Olayiwola and Ghiaasiaan [12, 13] have also applied and tried to predict the flow boiling heat transfer coefficient with the several existing correlations using the dataset available in the literature [14,15,16]. They have found that the performance of each correlation is different for a different dataset, and none of the correlations could provide an accurate prediction of all the three datasets. In the recent past, support vector machines (SVMs) have been adapted for regression. Support vector regression (SVR), a data-driven modeling technique is able to map causal factors and consequent outcomes from the observed patterns (experimental data), without deep knowledge of the complex physical process and this modeling technique is now becoming popular among engineers. Moreover, it offers a lot of advantages over the traditional existing techniques like artificial neural networks (ANN) in the sense that SVM gives a unique, optimal and global solution for the quadratic programming problem, and only two parameters need to be selected: upper bound and kernel parameter. This is because of the structural risk minimization (SRM) principle of SVM which provides good generalization performance. Two more advantages of SVMs are that they have a simple geometric interpretation and give a sparse solution. SVMs find use in many applications such as in predicting the thermal-hydraulic performance of compact heat exchangers [17], prediction of circulation rate in thermosiphon reboilers [18], streamflow forecasting [19], landslide prediction [20], time series prediction [21], pharmaceutical data analysis [22], prediction of heavy metal removal efficiency from waste water [23, 24] etc. Its excellent prediction performance to many real world problems has motivated the authors to apply SVR for the modeling of both saturated and subcooled boiling of R11 in mini-channels. To the best of the authors’ knowledge, there is no published literature available on the application of support vector regression (SVR) for modeling of flow boiling processes in mini-channels. It is for the first time that SVR has been applied to predict the heat transfer coefficient of R11 in mini-channels.

In this study, SVR-based models have been developed and validated for both subcooled and saturated boiling with the experimental data available from the literature of boiling flow using the refrigerant R-11 [14]. Furthermore, the results obtained from the developed SVR-based models have been compared with those obtained from several existing correlations. The SVR-based models have shown a high degree of agreement with the experimental results.

2 Abridged theory of SVR-based modeling

SVM theory in detail can be well understood from various works in the literature [25,26,27,28]. SVR-based models do not suffer from over-fitting problems i.e. high accuracy for training dataset and low for test dataset (unseen) as they are based on the SRM principle which always guarantees a unique, global and optimal solution. SVR-based models do not rely on the amount of data. Even with a fewer amount of data, they can give a good prediction. However, the only disadvantage of using SVR technique is that it requires optimization of the model and kernel parameters to give excellent prediction with higher accuracy i.e. SVR-based parameter needs to be optimized via some optimization techniques like grid search methodology with 10-fold cross-validation, genetic algorithm (GA), particle swarm optimization (PSO), differential evolution (DE) etc.

In ε-SVR regression, a function is fitted so as to correctly predict the outputs yi for a new set of independent input variables, xi. Training dataset for a typical regression problem is as follows:

$$ \left\{\left({\mathrm{x}}_1,{\mathrm{y}}_1\right)\dots \left({\mathrm{x}}_1,{\mathrm{y}}_1\right)\right\}\subset \mathrm{X}\times \mathfrak{R} $$
(1)

where x denotes the space of the input patterns.

The regression function in the feature space can be estimated as:

$$ f\left(\mathrm{x},w\right)=\left(w\bullet \varnothing \left(\mathrm{x}\right)+b\right) $$
(2)

where, f is termed as feature function and w●ϕ(x) as the dot product in the high dimensional feature space, F. SVR first maps the input data into high dimensional feature space with a nonlinear mapping function.

The regularized risk function is minimized to compute the coefficients as given below:

$$ \mathrm{R}(f)=\frac{1}{2}\left\Vert {w}^2\right\Vert +C\frac{1}{N}{\sum}_{i=1}^N{L}_{\epsilon}\left(y,f\left(\mathrm{x},w\right)\right) $$
(3)

where, the first term of this equation is the model flatness and the second term represents the empirical or training error and is calculated by ε-insensitive loss function as proposed by Vapnik [29] given as:

$$ {L}_{\varepsilon}\Big(y,f\left(\mathrm{x},w\right)=\left\{\begin{array}{c}\kern1em 0\kern3.5em if\ \left|y-f\left(\mathrm{x},w\right)\right|\le \varepsilon \\ {}\left|y-f\left(\mathrm{x},w\right)\right|-\varepsilon \kern1.5em otherwise\end{array}\right. $$
(4)

The regularization parameter C is a trade-off between the empirical or training error and flatness of the model. Slack variables ξ and ξ* are introduced to measure the deviation larger than ε as demonstrated in Fig. 1 and the expression for SVR can be mathematically given using Eq. (5):

$$ {\displaystyle \begin{array}{cc}\operatorname{Minimize}& \frac{1}{2}\left\Vert {w}^2\right\Vert +C{\sum}_{i=1}^N{\xi}_i+{\xi}_i^{\ast}\\ {}\mathrm{Subject}\ \mathrm{to}& \left({y}_i-w\bullet \varnothing \left(\mathrm{x}\right)-b\le \varepsilon +{\xi}_i\right)\\ {}& \left(w\bullet \varnothing \left(\mathrm{x}\right)+b-{y}_i\right)\le \varepsilon +{\xi}_i^{\ast}\\ {}& {\xi}_i,{\xi}_i^{\ast}\ge 0\end{array}} $$
(5)
Fig. 1
figure 1

The representation of SVM for regression problem using ε-insensitive loss function

Transforming Eq. (5) into the dual form using Lagrange multipliers, λ and λ* and the final form is given as:

$$ f\left(x,{\lambda}_i,{\lambda}_i^{\ast}\right)={\sum}_{i=1}^{N_{sv}}\left({\lambda}_i-{\lambda}_i^{\ast}\right)\left(\varnothing \left({\mathrm{x}}_i\right)\bullet \varnothing \left({\mathrm{x}}_j\right)\right)+b $$
(6)

The solution of Eq. (6) gives the values of the Lagrange multipliers, λ and λ*. But, only those input vectors, xi with non-zero coefficients of λ and λ* are the support vectors (SVs).

As the dimensions in the input space increase, the computation in the feature space becomes cumbersome which is called as the curse of dimensionality. So, to avoid this, kernel functions are employed [30]. Kernel functions transform the input space data into a linear representation in the high dimensional feature space, thereby facilitating all the computations to be carried out implicitly in the input space instead of in the feature space [31, 32]. This is called the kernel trick and the kernel function has the following form:

$$ K\left({\mathrm{x}}_i,{\mathrm{x}}_j\right)=\left(\varnothing \left({\mathrm{x}}_i\right)\bullet \varnothing \left({\mathrm{x}}_j\right)\right) $$
(7)

Moreover, a kernel function must satisfy the Mercer’s condition i.e. it should be symmetric, and positive semi-definite [33]. The most commonly used kernel function is the Gaussian radial basis function (RBF) defined below:

$$ K\left({\mathrm{x}}_i,{\mathrm{x}}_j\right)=\mathit{\exp}\left(-\frac{{\left\Vert {\mathrm{x}}_i-{\mathrm{x}}_j\right\Vert}^2}{2{\sigma}^2}\right) $$
(8)

where, σ denotes the width of the RBF and

$$ \gamma =\frac{1}{2{\sigma}^2} $$
(9)

Thus, the basic SVR function carries the following form:

$$ f\left(\mathrm{x},{\lambda}_i,{\lambda}_i^{\ast}\right)={\sum}_{i=1}^{N_{sv}}\left({\lambda}_i-{\lambda}_i^{\ast}\right)K\left({\mathrm{x}}_i,{\mathrm{x}}_j\right)+b $$
(10)

Karush-Kuhn-Tucker (KKT) conditions are used to compute the parameter, b, where the product between dual variables and constraints vanishes at the optimal solution.

3 Correlations for the flow boiling heat transfer

For evaluating the heat transfer coefficient in micro and mini-channels, eight different correlations given in literature have been selected. Table 1 presents these selected correlations along with their respective range of applicability.

Table 1 Flow boiling heat transfer correlations

3.1 SVR-based modeling procedure

The steps involved in the development of the SVR- based model are [43, 44]:

  • Collection of data as independent and dependent variables

  • Data preprocessing (normalization and scaling of data)

  • Dividing the whole dataset into training and test dataset using simple random sampling technique (SRS)

  • Selection of the appropriate kernel

  • Optimizing the model parameters (C, ε) and kernel parameter (γ) with grid search methodology and k-fold cross-validation.

  • Model training with the optimum value of the model and kernel parameters (C,εγ).

  • Evaluation of model performance in terms of statistical parameters such as average absolute relative error (AARE), root mean square error (RMSE), standard deviation (SD), mean relative error (MRE), leave-one-out cross-validation on training dataset (\( {Q}_{Loo}^2 \)) and leave-one-out cross-validation on test dataset (\( {Q}_{ext}^2 \)).

4 Results and discussions

The flow boiling heat transfer coefficient for R-11 has been examined in a smooth copper tube with an inner diameter of 1.95 mm by Bao et al. [14]. They have performed the experiment over a wide range of parameters with heat flux ranging from 5 to 200 kW/m2; mass flux from 50 to 1800 kg/m2 s, vapor qualities from 0 to 0.9, and system pressures from 0.2 to 0.5 MPa. Experiments performed by Bao et al. [14] have covered the whole saturated and subcooled boiling flow regime.

In this work, a comparative study of the existing correlations for flow boiling heat transfer in micro/mini-channels has been done. Furthermore, SVR-based models have been developed for predicting the flow boiling heat transfer coefficient in mini-channels for saturated and subcooled boiling, using the same dataset available from the published literature [14].

An SVR-implementation known as “ε-SVR” in the software LIBSVM library [45] on MATLAB platform has been used to develop the SVR-based models. A comparison of the performance of the SVR-based models with the existing correlations has been carried out based on statistical parameters such as average absolute relative error (AARE), correlation coefficient (R) etc. Moreover, the SVR-based models have been used to predict the effect of various independent variables on the flow boiling heat transfer coefficient and comparison has been made with experimental data.

4.1 Support vector regression-based model for subcooled flow boiling heat transfer

The whole dataset of 324 samples for subcooled boiling has been divided into the training dataset and the test dataset with simple random sampling (SRS) technique, as 80% (259 data points) and 20% (65 data points), respectively.

In the present work, the RBF kernel has been utilized as it requires only one parameter to be adjusted and has been found to give a good generalization performance than the other kernels [27]. Grid search methodology with 10-fold cross-validation has been used to obtain the optimal values of the SVR-based model and kernel parameters (C, ε, and γ) by varying these parameters in a wide range of: C [25, 215], γ [2,3,4,5,6,7,8,9,10,11,12,13,14,15, 22] and ε [2,3,4,5,6,7,8,9,10,11,12,13,14,15, 24] on the training dataset. In this process the whole dataset has been partitioned into 10 equal parts. Thereafter, 9 parts have been used to develop the model for a fixed set of these parameters (C, ε and γ) and each time the remaining one part has been utilized for validation (testing). The optimal parameters have been those which give the lowest mean squared error (MSE) [44, 46].

The value of regularization parameter C should not be too large as it can over-fit the data. On the other hand a small value of C increases the training error resulting in under-fitting and the model does not properly fit the learning data. A large value of epsilon ε encloses a small number of support vectors (SVs) and thus results in more flat estimates (a low accuracy but simple model). On the other hand a small value of ε has more data points lying outside the tube giving a large number of SVs. The kernel parameter gamma, γ in case of RBF kernel, is the inverse of standard deviation, which basically measures the similarity between two points. A small value of gamma means a large variance i.e. the chances of over-fitting increase. While a large value of gamma signifies a small variance resulting in under-fitting of the function [47]. Keeping these points in mind, the optimization procedure has been carried out. Table 2 gives the obtained optimal values of the SVR-based model parameters.

Table 2 Optimal parameters for the SVR-based model for predicting the subcooled flow boiling heat transfer coefficient

After obtaining the optimal values of the SVR parameters, training and test course curves for subcooled boiling have been constructed using training and test datasets as shown in Figs. 2a and b respectively.

Fig. 2
figure 2

a Training course curve for predicting the subcooled flow boiling heat transfer coefficient, h.b Test course curve for predicting the subcooled flow boiling heat transfer coefficient, h

Figure 3 shows that the predicted values of the subcooled flow boiling heat transfer coefficient, h for the training data as well as the test data lie close to the ideal fit line and all the predicted data points lie within a total error of ±4%. Table 3 exhibits the model evaluation parameters of the SVR-based model for both the training data as well as the test data during subcooled boiling. The obtained values of model evaluation parameters for the training data and test data reveal a high accuracy with excellent prediction performance of the SVR-based model.

Fig. 3
figure 3

SVR simulation for predicting the subcooled flow boiling heat transfer coefficient, h using optimal parameters

Table 3 SVR-based evaluation indices based on statistical parameters for subcooled boiling

4.1.1 Comparative analysis of SVR-based model with the existing correlations for subcooled flow boiling heat transfer

There are some common correlations available in the literature, which can be applied to both subcooled and saturated boiling like Gungor & Winterton [35], Liu & Winterton [39], Warrier [41], Shah [40], Lazark & Black [37] and Piasecka [42]. A comparison has been made of the prediction performances of the SVR-based model with the existing correlations for the subcooled flow. Figure 4 gives the superior prediction performance of the SVR-based model. Table 4 exhibits the relevant model evaluation parameters for the SVR-based model and the correlations on the test dataset. The obtained results in Table 4 and Fig. 4 reveal that most of the correlations did not predict the flow boiling heat transfer coefficient very well, as opposed to the SVR-based model, which may again be attributed to the fact that they have been developed based on the empirical risk minimization (ERM) principle while the SVR-based model has been developed on the SRM principle. The SRM principle of SVR optimizes the generalization accuracy over the empirical error as well as the capacity of the machines or the flatness of the model, while the ERM principle only minimizes the empirical error and does not take into account the model flatness. This results in overtraining i.e. high accuracy for training dataset and low for test dataset (unseen) giving poor generalization performance [25, 29]. The SVR-based model gives the highest prediction accuracy followed by Piasecka [42] and Gungor and Winterton [33].

Fig. 4
figure 4

Comparison of SVR-based with the various correlations for predicting the subcooled flow boiling heat transfer coefficient, h

Table 4 Comparison of SVR-based model with the existing subcooled correlations using test dataset

4.2 Support vector regression-based model for saturated flow boiling heat transfer

SVR-based model has also been developed for saturated boiling using the data from the literature [14]. Again, the whole dataset of saturated boiling with 443 samples has been partitioned into the training dataset and test dataset as 80% (354 data points) and 20% (89 data points), respectively. A similar procedure has been carried out for obtaining the optimal values of the SVR-based parameters (C, ε, and γ). The finally obtained optimal values of the model parameters are given in Table 5. It has been observed that with 354 training data points, 58 SVs have been obtained. The lesser the number of SVs, the more sparse is the solution obtained.

Table 5 Optimal parameters for the SVR-based model for predicting the saturated flow boiling heat transfer coefficient

After optimizing the SVR parameters, training and test course curves for saturated boiling have been constructed using training and test dataset as shown in Figs. 5a and b respectively.

Fig. 5
figure 5

a Training course curve for predicting the saturated flow boiling heat transfer coefficient, h.b Test course curve for predicting the saturated flow boiling heat transfer coefficient, h

The model output for the training and test dataset, between the experimental values of heat transfer coefficient h and those predicted by the SVR-based model, have been shown in Fig. 6. It is evident that both the predicted values for the training dataset as well as the test dataset lie close to the ideal fit line. Moreover, all the predicted data points lie within a total error of ±6%. Table 6 exhibits the model evaluation parameters of the SVR-based model for both the training data as well as the test data. The close proximity to each other of the model evaluation parameters between the training dataset and test dataset testify to the excellent prediction performance of the SVR-based model and also to its high generalizability.

Fig. 6
figure 6

SVR simulation for predicting the saturated flow boiling heat transfer coefficient, h using optimal parameters for training dataset and test dataset

Table 6 SVR-based model evaluation based on statistical parameters for saturated boiling

4.2.1 Comparison of the SVR-based model with the existing correlations for saturated flow boiling heat transfer

Prediction performance of the SVR-based model has been compared with the existing saturated boiling correlations over an ideal fit line shown in Fig. 7. Table 7 exhibits the relevant model evaluation parameters for the SVR-based model and the correlations on the test dataset. The obtained results in Table 7 and Fig. 7 reveal that most of the correlations under-predict the flow boiling heat transfer coefficient. However, Gungor & Winterton [35], in particular, predicted the experimental data with reasonable accuracy. The relatively poor performance of these correlations to predict the experimental data very well may be attributed to the fact that they are not general and have been developed for different flow conditions with different ranges. It can be easily surmised that the SVR-based model is an immensely improved model in comparison to the existing correlations.

Fig. 7
figure 7

Comparison of SVR-based with the various correlations for predicting the heat transfer coefficient, h during saturated boiling

Table 7 Comparison of the SVR-based model with the existing saturated boiling correlations over test dataset

4.3 Parametric effects on flow boiling heat transfer coefficient

In the this section, the effect of the various independent variables viz. vapor quality (x), mass flux (G), system pressure (P), and heat flux (q), on flow boiling heat transfer coefficient (h) predicted by the SVR-based models are discussed in the light of the experimental results and the existing theory.

4.3.1 The effect of mass flux and vapor quality on the flow boiling heat transfer coefficient

Figure 8 shows the experimentally determined heat transfer coefficient plotted against quality as a function of mass flux, at constant heat flux and system pressure. Quality, x within 0 ≤ x ≤ 1 corresponds to the saturated boiling while x < 0 falls under the subcooled boiling region. Vapor quality, x for saturated region varies between 0.001 and 0.84, while heat transfer coefficient ranges from 4.35 up to 15.5 kW/m2.K. It is clear from the Fig. 8 that the heat transfer coefficient is independent of the mass flux and the vapor quality. This can be attributed to the fact that the heat transfer is dominated by nucleate boiling. Further, it has also been observed that the predicted curve follows the same trend as that of the experimental results.

Fig. 8
figure 8

A plot of experimental and predicted heat transfer coefficient, h of R11 versus vapor quality, x at various mass flux (422 kPa inlet pressure; 58 kW/m2 heat flux)

4.3.2 The effect of heat flux on the flow boiling heat transfer coefficient

Figure 9 shows the experimental and predicted trend of heat transfer coefficient at a mass flux of 446.4 kg/m2.s at different values of heat flux. It has been observed that the heat flux, q, has a significant effect on heat transfer coefficient i.e. heat transfer coefficient increases with increasing the heat flux. The higher the heat flux, the higher the heat transfer coefficient [48]. The prediction performance of the SVR-based models has been found to be in good agreement with the experimental results as is evident from Fig. 9.

Fig. 9
figure 9

A plot of the experimental and the predicted heat transfer coefficient, h of R11 versus vapor quality, x for different heat fluxes (460 kPa inlet pressure; 446 kg/m2.s mass flux)

4.3.3 The effect of system pressure on the flow boiling heat transfer coefficient

The obtained experimental results in Fig. 10 have revealed that the heat transfer coefficient increases with increasing pressure. An increase in pressure causes variation in fluid properties such as decreasing surface tension and liquid-to-gas density (ρlg) which aids in the increase of the number of active nucleation sites [49]. It is, therefore, evident that the heat transfer coefficient in the nucleate boiling region is mainly affected by heat flux and pressure and is less sensitive to mass flux and vapor quality. It can be observed from Fig. 10 that the SVR-based models have faithfully predicted the experimental trends.

Fig. 10
figure 10

A plot of the experimental and the predicted heat transfer coefficient, h of R11 versus vapor quality, x, at different pressures (heat flux is 120 kW/m2 and the mass flux is 446 kg/m2.s)

Thus, it can be confidently stated that the developed SVR-based models accurately depict the effect of the independent parameters like mass flux, vapor quality, heat flux and pressure on the flow boiling heat transfer coefficient of R11. This accuracy of the SVR-based models is attributed to SRM. That optimizes the generalization accuracy around the empirical error and the model flatness or the capacity of SVM.

5 Conclusions

In this paper, the application of SVR for the prediction of the flow boiling heat transfer coefficient of R11 in mini-channels has been presented. It has been found that the SVR prediction results are consistent with the experimental data. The simulation results have been compared with the well-established correlations found in the literature. It has been seen that the SVR-based models are far superior in performance to these correlations. Among the existing correlations, the predictions of heat transfer coefficient by Gungor and Winterton [35] and Piasecka [42] are the most accurate. The SVR-based models were also tested for analyzing the effects of mass flux, vapor quality, heat flux and pressure on the flow boiling heat transfer coefficient in the subcooled boiling and the saturated boiling regions. It was found that the SVR-based model faithfully and accurately followed the same trend as depicted by the experimental data with respect to all the independent parameters such as mass flux, vapor quality, heat flux, and pressure, with high accuracy. This high accuracy can be attributed to the structural risk minimization (SRM) principle on which SVR is based.