1 Introduction

Over the past decades, ferrite-pearlite steels with their advantages of low cost and simple manufacturing have been widely used for various industries [1,2,3,4,5,6,7,8,9,10,11,12]. The tensile properties of ferrite-pearlite steels are mainly affected by a number of alloying elements and microstructural factors [13,14,15,16,17,18,19,20,21]. For ferrite-pearlite steels with carbon content below the eutectoid composition, there are various microstructural factors such as phase fraction, ferrite grain size, interlamellar spacing, and cementite thickness that depend on alloying elements and heat treatment. In addition, the alloying elements that were incorporated to the ferrite-pearlite steels affect the tensile properties thorough solid solution and precipitation strengthening. Because the relationship between alloying elements, microstructural factors, and tensile properties is complex, more systematic studies are necessitated to understand this relationship.

Recently, artificial neural network (ANN) techniques that facilitate the learning of input–output parameter relationship for complex problems have been applied to predict and analyze various phenomena occurring in materials. The main feature of ANNs is that, unlike other analysis methods such as physical models and linear regression models, these do not require a specific equation form and only need sufficient input–output parameter data to solve a problem. ANNs with the capacity for defining the relationship between independent and dependent parameters have been used to elucidate unclear problems and have been successfully applied to numerous applications in materials science [22,23,24,25]. ANN models have been reported to design alloys [26] and to predict microstructures [27], hardness [28], and tensile properties [27] according to chemical compositions and heat treatment conditions in carbon steels with a ferrite-pearlite structure. However, the relative importance of alloying elements and microstructural factors in determining the tensile properties of ferrite-pearlite steels has not been studied yet.

The primary objectives of the present study are: (i) to predict the tensile properties of ferrite-pearlite steels using various alloying elements and microstructural factors, (ii) to determine the influence of alloying elements and microstructural factors on tensile properties individually as well in combinations of two or more input parameters, and (iii) to quantitatively estimate the importance of input parameters on tensile properties by calculating the index of relative importance.

2 Materials and Methodology

2.1 Model Establishment

The experimental data used for this study were collected from published literature [14] on ferrite-pearlite steels with equiaxed grains and a pearlite fraction between 55 and 95%. The data sets consisted of the content of alloying elements such as carbon (C), manganese (Mn), silicon (Si), aluminum (Al), niobium (Nb), vanadium (V), and titanium (Ti) and microstructural factors such as the pearlite fraction (\(V_{P}\)), ferrite grain size (\(d_{F}\)), interlamellar spacing (\(I_{S}\)), and cementite thickness (\(t_{CM}\)) with the desired yield strength (YS) and tensile strength (TS). The microstructure of ferrite-pearlite steels was observed using an optical microscope and a field emission scanning electron microscope (FE-SEM; model JSM6700F, JEOL, Japan), then the pearlite fraction, ferrite grain size, and interlamellar spacing were quantitatively measured, as shown in Fig. 1. The cementite thickness was calculated using equation [29].

Fig. 1
figure 1

a Optical and b SEM micrographs showing the microstructural factors (pearlite fraction, ferrite grain size, interlamellar spacing, and cementite thickness) of ferrite-pearlite steels (data set 54)

Sub-size plate-type tensile specimens with a gage length of 25.4 mm were taken along a rolling direction. The tensile tests were performed at room temperature and a strain rate of \(3.3 \times 10^{ - 3} /sec\) using a 10 ton universal testing machine (model; UT-100E, MTDI, Korea) according to the ASTM E8 standard testing method. The input parameters considered for the ANN model development were the C, Mn, Si, Al, Nb, V, and Ti contents, and the pearlite fraction, ferrite grain size, interlamellar spacing and cementite thickness; the output parameters were the yield and tensile strength. The details of the input and output parameters used in the study are presented in Table 1. All the parematers were converted between 0.1 and 0.9. The conversion process is expressed as follows:

$$x_{n} = \left( {\left( {x - x_{min} } \right) \times 0.8} \right)/\left( {x_{max} - x_{min} } \right) + 0.1$$
(1)

where \(x_{n}\) is the converted value of \(x\); \(x_{max}\) and \(x_{min}\) are the maximum and minimum value of \(x\), respectively, in the entire data sets. Once the best-trained network was found, all the transformed data sets were put back into their original values using the following equation:

$$x = \left( {x_{n} - 0.1} \right)\left( {x_{max} - x_{min} } \right)/0.8 + x_{min}$$
(2)
Table 1 Details of the input and output parameters of training and test data used for ANN model

2.2 Modeling Procedure

In this study, the ANN model was trained using a backpropagation algorithm and the sigmoid function was used as an activation function [30, 31]. The ANN model training program and the graphical user interface design of the present ANN model were written in C and Java, respectively. The ANN model consisted of 11 neurons (C, Mn, Si, Al, Nb, V, and Ti contents and pearlite fraction, ferrite grain size, interlamellar spacing, and cementite thickness) in the input layer and two neurons (yield and tensile strength) in the output layer. The neural network training consisted of adjusting the weights associated with each connection between the neurons until the computed output parameters for each data set of input parameter were as close as possible to the experimental output parameter values. To determine the optimum architecture and to find the confidence of the ANN model, the data sets were split into training and testing data sets. The available 57 data sets were divided into 42 training data sets and 14 testing data sets.

The architecture of the ANN model consists of hidden layers and neurons in the hidden layers, as well as momentum term, learning rate, and number of iterations. The optimal parameters for the network were determined based on the average training error in the output parameter prediction (\(E_{tr}\)) of the trained data sets.

$$E_{tr} \left( y \right) = \frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left| {\left( {T_{i} \left( y \right) - O_{i} \left( y \right)} \right)} \right|$$
(3)

where \(E_{tr} \left( {\text{y}} \right)\) is average training error in the training prediction and testing data sets for output parameter \({\text{y}}\), \(N\) is the number of data sets, \(T_{i} \left( y \right)\) is the targeted output parameter, and \(O_{i} \left( y \right)\) is the calculated output parameter.

In this study, the momentum term, learning rate, and iterations of the ANN model were optimized as 0.8, 0.3, and 10,000 respectively at the initial state based on the \(E_{tr} \left( {\text{y}} \right)\). In order to find the suitable number of neurons and hidden layers, two basic structures of the neural networks are examined: one with single hidden layer and the other with two hidden layers. Each structure was train with different hidden neurons starting from 2 to 20. Figure 2a shows the variation in the average training error as a function of hidden layers and neurons for the yield and tensile strength. The minimum average training error or the output parameters was obtained using two hidden layers with 14 neurons in each layer. The two hidden layers with 14 neuron architecture was selected to optimize the other parameters. The number of iterations executed varied from 5000 to 100,000 as exhibited in Fig. 2b. As the number of iterations increased, the average training errors from the output parameters decreased, and after 30,000 iterations, the average training errors from the output parameters were almost constant. Hence the number of iterations was fixed at 30,000. Once the number of iterations was selected, the momentum term and learning rate coefficients were varied from 0.1 to 0.9 in steps of 0.1, respectively, as provided in Fig. 2c and d. The minimum training errors for the output parameters were generated in a momentum term of 0.8 and a learning rate of 0.6. Therefore, an optimum ANN model with 11–14-14–2 architecture that consisted of an 0.8 momentum term and a 0.6 learning rate with 30,000 iterations was selected in this study. The optimum ANN model was compared with an existing equation model, as follows [14]:

$${{Yield strength}}\; ({\text{MPa}}) = 15.4\left[ {V_{F}^{\frac{1}{3}} \cdot \left( {2.3 + 3.8 \cdot \left( {\% Mn} \right) + 1.13 \cdot d_{F}^{{\frac{ - 1}{2}}} } \right) + \left( {1 - V_{F}^{\frac{1}{3}} } \right) \cdot \left( {11.6 + 0.25 \cdot I_{S}^{{\frac{ - 1}{2}}} } \right) + 4.1 \cdot \left( {\% Si} \right)} \right]$$
(4)
$${{Tensile strength}}\; ({\text{MPa}}) = 15.4\left[ {V_{F}^{\frac{1}{3}} \cdot \left( {16.0 + 1.18 \cdot d_{F}^{{\frac{ - 1}{2}}} } \right) + \left( {1 - V_{F}^{\frac{1}{3}} } \right) \cdot \left( {46.7 + 0.23 \cdot I_{S}^{{\frac{ - 1}{2}}} } \right) + 6.3 \cdot \left( \% {{Si}} \right)} \right]$$
(5)

where \(V_{F}\) is the ferrite fraction (vol%), \(d_{F}\) is the ferrite grain size (μm), \(I_{S}\) is the interlamellar spacing (μm), \(\% Mn\) is the Mn content (wt%), and \(\%{Si}\) is the Si content (wt%).

Fig. 2
figure 2

Variation in average training error as a function of a hidden neurons with different hidden layers, b iteration, c momentum term, and d learning rate for yield strength and tensile strength

2.3 Index of Relative Importance (\({\mathbf{I}}_{{{\mathbf{RI}}}}\))

The index of relative importance (\(I_{RI}\)) is a vector quantity. The direction and amount of the index of relative importance indicate the significance of the input parameters in the yield and tensile strength. In this study, the instantaneous importance of the input parameters on the output parameters has been estimated. In the equation \({\text{Y}} = {\text{f}}\left( {X_{1} + X_{2} + X_{3} + X_{4} \cdots X_{11} } \right)\), \({\text{Y}}\) is the yield or tensile strength; \(X_{1}\) to \(X_{11}\), represent the content of C, Mn, Si, Al, Nb, V, and Ti, and the pearlite fraction, ferrite grain size, interlamellar spacing, and cementite thickness, respectively. The procedure involved the calculation of the index of relative importance as shown below.

  1. (a)

    The % band of \(i\) th input parameter \(= \left( {\left( {X_{i} } \right)Maximum - \left( {X_{i} } \right)Minimum} \right)/100\)

  2. (b)

    The 6% band of each input parameter was considered i.e., + 3% and  − 3%

  3. (c)

    As a result, input parameter \(X_{1}\) has two row matrices \(\left[ { + 3{\text{ \% }}X_{1} + X_{2} + X_{3} + X_{4} \cdots X_{11} } \right]\) and \(\left[ { - 3{\text{ \% }}X_{1} + X_{2} + X_{3} + X_{4} \cdots X_{11} } \right]\)

  4. (d)

    When these two row matrices were passed through the ANN model, \((Y_{1} )_{{ + 3\% X_{1} }}\) and \((Y_{1} )_{{ - 3\% X_{1} }}\) were predicted.

  5. (e)

    The difference, \(\Delta \left( {y_{1} } \right) = (Y_{1} )_{{ + 3\% X_{1} }} - (Y_{1} )_{{ - 3\% X_{1} }}\) was calculated.

  6. (f)

    The index of relative importance (\(I_{RI}\)) of input parameter \(X_{1}\) was calculated from the formula:

    $$I_{RI} = \Delta \left( {y_{1} } \right)/\left( {\left( {y_{1} } \right)Maximum - \left( {y_{1} } \right)Minimum} \right)$$
    (6)

The process (from step 1 to step 6) was repeated to calculate the index of relative importance for the remaining input parameters, and the process for data set 36 is presented in Table 2. While one input parameter was varied by a \(\pm 3{\text{ \% }}\) offset, the other input parameters were kept constant. After adding the \(\pm 3{\text{ \% }}\) variation to all the input parameters, ten combinations of input parameter data sets were created. These data sets were fed to the ANN model to predict the respective output parameters and thereby to calculate the index of relative importance for each input parameter.

Table 2 Procedure of estimating the index of relative importance (\(I_{RI}\)) of input parameters on yield and tensile strength of data set 36

3 Results and Discussion

The performance of the ANN model can be estimated by calculating the percent error for the output parameter of the testing data set. Figure 3 exhibits the comparison of predictions and percent error in the yield and tensile strength using the ANN model and equation model. For the percent error of the yield and tensile strength (Fig. 3b and d), the ANN model for training data sets has a percent error of 0.1% and 0.0%, respectively. However, the equation model for training data sets has percent error of 13.0% and 9.6% on the yield and tensile strength, respectively. Even in the yield and tensile strength predictions of testing data sets, percent errors of the ANN model are lower than the equation model with percent errors of 24.3% and 20.7%, respectively, at 11.0% and 9.7%. These results indicate that the ANN model has better agreement with experimental data sets than the equation model. The ANN model that completed the validation can be used to determine the relationships between input parameters and output parameters.

Fig. 3
figure 3

Performance of the ANN model and the existing equation model [14] for the prediction of the a yield strength and b tensile strength, and the percent error in the c yield strength and d tensile strength for 42 training and 14 testing data sets

In contrast, most of the carbon in ferrite-pearlite steels is precipitated as carbide and cementite of pearlite [12, 13]. Figure 4 presents the predicted yield and tensile strength of data set 36 as a function of an input parameter except C, keeping other input parameters. The variations in the yield and tensile strength changed more with Mn, Si, and V contents and ferrite grain size compared to the other input parameters. Increasing the Mn, Si, and V contents increases the yield and tensile strength, whereas increasing the ferrite grain size decreases the yield and tensile strength. It is well known that Mn and Si are effective elements for solid solution strengthening, and Al, Nb, Ti, and V are stronger elements for precipitation strengthening by forming precipitates such as carbide and nitride combined with C and nitrogen (N) [14,15,16,17,18, 32,33,34,35,36,37]. Furthermore, it has been reported that the precipitates decrease ferrite grain size and increase ferrite fraction by decreasing austenite grain size due to the pinning effect [32,33,34,35,36,37]. In the case of microstructural factors, the variations in the yield and tensile strength from each input parameter were mostly similar. The fine interlamellar spacing together fine ferrite grain size commonly increase the yield and tensile strength according to the well-known Hall–Petch relationship [38]. The results in this study indicates that microstructural factors of ferrite-pearlite steels affect the yield and tensile strength by interacting with each other.

Fig. 4
figure 4

Predicted yield strength and tensile strength of data set 36 as a function of a Mn, b Si, c Al, d Nb, e Ti, f V contents, g pearlite fraction, h ferrite grain size, i interlamellar spacing and j cementite thickness, keeping other input parameters

Based on the result in Fig. 4, which shows that Mn, Si, and V contents and ferrite grain size substantially change the predicted yield and tensile strength of data set 36, Figs. 5 and 6 exhibit the predicted yield and tensile strength as a function of V content and ferrite grain size for different Mn and Si contents. The contour plots enable the selection of the condition for the desired yield and tensile strength by visual inspection. The range of Si, Mn and V contents relates to the amount to the usual ferrite-pearlite steels. Each point in Figs. 5 and 6 indicates one ferrite-pearlite steel system. The predicted yield strength tends to increase almost linearly with increasing the content of Mn, Si and V and decreasing ferrite grain size (Fig. 5). At 1.54 wt% Mn and above 0.67 wt% Si, the regions of maximum predicted yield strength are observed for the fine ferrite grain size and high V content. In contrast, the predicted tensile strength increases based only on Mn and Si contents and shows an unclear tendency for V content and ferrite grain size (Fig. 6). The regions of maximum predicted tensile strength are observed at 1.54 wt. Mn and above 0.67 wt% Si. These predicted yield and tensile strength maps will provide an insight into the absolute amounts of alloying elements and microstructural factors needed to make the desired ferrite-pearlite steel that has the required yield and tensile strength.

Fig. 5
figure 5

Contour of predicted yield strength as a function of V content and ferrite grain size at different Si and Mn contents

Fig. 6
figure 6

Contour of predicted tensile strength as a function of V content and ferrite grain size at different Si and Mn contents

There are numbers of reports regarding the effect of various alloying elements and microstructural factors on the yield and tensile strength of ferrite-pearlite steel [3,4,5,6,7,8,9,10,11,12,13,14]. In this study, to identify the quantitative influence of various alloying elements and microstructural factors, the index of relative importance was calculated. Figure 7 presents the average index of relative importance of the input parameters (alloying elements and microstructural factors) on the output parameters (yield and tensile strength). For the average index of relative importance for yield strength (Fig. 7a), Mn, Si, and V had a positive effect, while the ferrite grain size and interlamellar spacing had a negative effect, and the other input parameters were relatively insignificant. In particular, the V and ferrite grain size had a great effect than other input parameters. The yield strength is generally known to increase when the slip of dislocation is prevented by many factors such as stress field, grain boundary, precipitates, and dislocation [39]. The precipitates and grain boundary are effective inhibitors to the slip of dislocation. Since Al, Nb, and Ti prefer to reduce grain size and V relatively contributes to precipitation strengthening element and, it is reasonable that the yield strength of ferrite-pearlite steels is greatly influenced by the V content and ferrite grain size.

Fig. 7
figure 7

The average indexes of relative importance of input parameters on the a yield strength and b tensile strength of ferrite-pearlite steels

Figure 7b presents the average index of relative importance of alloying elements and microstructural factors with regard to tensile strength. The Si, V, and pearlite fraction had a positive influence, while the interlamellar spacing had a negative influence, and the influence of other input parameters was minor. In particular, Si and pearlite fraction had the greatest impact. It is well known that tensile strength is related to work hardening after yielding. Precipitates with coherency boundaries with matrix and pearlite are effective at work hardening [40,41,42]. Furthermore, Si enhances the work hardening of the pearlite and precipitates because the solute Si stabilizes the pearlite and precipitates by diffusing to the interface of the cementite/ferrite and precipitate/ferrite [43,44,45]. Moreover, the fine interlamellar spacing of pearlite enhances work hardening [40]. Accordingly, it is valid that tensile strength is greatly affected by Si and V contents and pearlite fraction that are related to increasing work hardening.

4 Conclusions

Based on the present artificial neural network (ANN) analysis for modeling the tensile properties of ferrite-pearlite steels with equiaxed grains, the following conclusions can be drawn.

  1. 1.

    An ANN model has been developed to predict the yield and tensile strength of ferrite-pearlite steels in terms of alloying elements (C, Mn, Si, Al, Nb, Ti, and V) and microstructural factors such as pearlite fraction, ferrite grain size, interlamellar spacing, and cementite thickness.

  2. 2.

    The yield and tensile strength predictions made using the ANN model with unseen data were in good agreement the experimental values, as compared with the calculated properties of the existing model, and the ANN model showed experimentally reliable trends by analyzing the complex nonlinear relationships.

  3. 3.

    The effect of alloying elements and microstructural factors on tensile properties of ferrite-pearlite steels can be quantitatively estimated with the help of the index of relative importance based on the ANN model to understand the complicated effects of input parameters on output parameters.