Keywords

1 Introduction

Austempered ductile iron (ADI), sparked a lot of interest among design engineers due to its high tensile strength with high fatigue strength, good wear resistance and ductility [1]. These desirable properties in ADI are achieved as a result of addition of a delicate proportion of several chemicals (alloying elements) in the ductile iron, and an isothermal heat treatment process called austempering. Since the hardness and ductility in cast-iron are inversely related, its strength and ductility, as desired for a specific application, can be widely varied by carefully selecting the proportion of chemical compositions and by varying the austempering conditions [2, 3]. Because of these characteristics, ADI is widely used in various engineering applications, e.g., manufacturing, railways, and automobiles. Some of the parts produced with ADI are gears, wheel hubs, crankshafts, connecting rods, earth moving machineries and railroads [4,5,6].

ADI belongs to the family of ductile cast irons whose weight percentage of carbon (C) and silicon (Si) is relatively high as compared to other alloying elements. Ductile cast iron is obtained by magnesium (Mg) treatment immediately prior to pouring casting, which produces spheroidal or nodular graphite in ductile iron [7,8,9]. In order to enhance austempering heat treatment and to achieve desirable mechanical properties, elements such as copper (Cu), nickel (Ni), molybdenum (Mo), manganese (Mn), are usually added to ductile iron [2, 8]. During the austempering process, the chemicals interact in a complex and highly nonlinear manner that influences the microstructure which in turn determines the hardness of the ADI. A non-destructive hardness test of a material is commonly used in industry and research because it provides an easy, reliable and inexpensive method to determine basic properties, such as, hardness. Vickers hardness test is one of such tests in which the Vickers hardness number (VHN) of a material can be determined [10]. Due to the complexity involved in the austempering process, any analytical technique to estimate the number and the proportion of the chemicals as well as the austempering temperature and process duration is currently not available.

Recently, artificial neural networks (ANNs) have evolved as a powerful modeling technique for such complex processes. ANNs have been successfully applied to model complex processes in several fields of science and engineering [11, 12], e.g., in modeling solar cells, fuel cells and photovoltaic arrays [13,14,15,16,17,18]. There has also been a great interest in the materials science community in ANN-based modeling [19,20,21,22]. However, only a few studies of ANNs to predict hardness of ADI have been reported [23, 24]. PourAsiabi et al. [24] have proposed a multilayer perceptron (MLP)-based modeling technique to estimate VHN of ADI with eight specimens. However, they have considered only two (out of seven) chemical components, i.e., Cu and Mo, in their modeling to estimate the VHN and have reported a mean absolute error (MAPE) of 0.71%. Although a tiny proportion of other chemical elements can contribute significantly towards hardness and tensile strength of ADI [2, 8], they have neglected other elements in their model.

In the present paper, we carried out an in-depth study to predict the VHN in ADI using the experimental data reported in [24]. We proposed three MLP-based modeling schemes to predict VHN by taking two, three and seven chemical compositions in the ADI. The major contribution of this paper is as follows. In Model-1, by carefully selecting the MLP parameters, we could achieve a MAPE of 0.33% that is superior to 0.71% that reported in [24]. In Model-2 we introduced an additional important chemical composition, i.e., Mg, and shown that it can achieve a maximum MAPE of 0.27%. Next we considered seven chemical constituents in Model-3 and shown that it can achieve a MAPE of 0.22% in predicting VHN. It is noteworthy that our proposed models can predict the VHN, even beyond the range of the training parameters, accurately.

2 Austempering Process of ADI

The austempering process of the ductile iron is commonly identified as a three-step heat treatment process shown in Fig. 1 [1, 2]. In Step-1 (A-B-C), the casting is heated to and held at austenitizing temperature Tγ (815–925 ℃) for approximately 2 h. This step is carried out to make the structure of the casting fully austenitic (γ). In Step-2 (C-D), rapid cooling (quenching) is performed from austenitizing temperature to austempering temperature TA (ranging between 260–400 ℃). Rapid quenching is required to avoid austenite transformation and formation of pearlite. In Step-3 (D-E), the austempering phase, the casting is held at a constant (isothermal) temperature in a salt bath over a period of 1–2 h. Two reactions take place during this isothermal heating depending on the duration of heating. In the first reaction, the austenite (γ) decomposes into acicular ferrite (α) and carbon-rich austenite \( (\upgamma_{\text{HC}} ) \). This microstructure is called ausferrite. If the casting is held at the austempering temperature for too long, a second reaction takes place, during which the carbon-rich austenite decomposes further into ferrite (α) and carbide. This second reaction makes the material brittle, which is an undesirable outcome and hence is avoided. The desirable mechanical properties (e.g., tensile strength and ductility) are obtained in ADI after the completion of the first reaction but before the onset of the second reaction. Finally, the casting is air cooled to room temperature TRoom (E-F) [2, 3].

Fig. 1.
figure 1

The ADI austempering process.

3 The ANN and Modeling Scheme

Here, we briefly describe the ANN and the modeling scheme used to predict the VHN of ADI.

3.1 Multilayer Perceptron (MLP)

The MLP is a feed-forward ANN with an input, one or more hidden layers, and one output layer. Each layer contains one or more nonlinear processing units called ‘neuron’, or ‘node’. Each node of a lower layer is connected to all the nodes of an upper layer through links called weights. The backpropagation (BP) algorithm is the most popular learning technique used to train the MLPs [11]. Let Ni, i = 1, 2,…, L, denote the number of nodes in layer i, in which i = 1 denotes the input layer and i = L denotes the output layer. During training process, an input vector, X = [x1 x2… xN1] and a target output vector, D = [d1 d2… dNL], are applied to the MLP. The MLP produces an output vector, Y = [y1 y2… yNL] depending on the weight values of different layers and the nonlinear activation function in each node.

The sum of output square error at the kth instant is given by

$$ E(k) = \sum\limits_{i = 1}^{{N_{L} }} {[d_{i} (k) - y_{i} (k)} ]^{2} $$
(1)

The BP algorithm attempts to minimize the cost function E(k) recursively by updating the weights of the network. The learning and the momentum rates used in the BP algorithm are denoted by α and β, respectively, and their values lie between 0 and 1.

3.2 The ANN-Based Modeling Scheme

Schematic diagrams of the ANN-based modeling of the ADI process are shown in Fig. 2, as a system identification process. Based on the number of chemical compositions used, we propose three MLP-based models to predict the VHN. Model-1 is proposed using Cu and Mo compositions along with austempering temperature and time in order to determine the influence of these compositions on VHN. The purpose of this model, in addition to verifying the effect of Cu and Mo, is to compare its performance with that of [24]. The influence of Mg, although a tiny proportion (about 0.05%), is quite significant in determining the microstructure and the mechanical properties of the ductile iron destined for ADI [7,8,9]. Therefore, in Model-2 we added Mg to Model-1 in order to predict VHN at different austempering temperature and time duration. The overall ADI process that affects the ausferrite microstructure, and in turn, its ductility and hardness, finally depends on all the major chemical compositions [2, 8]. Therefore, in order to predict the VHN, in Model-3 we considered additional four major chemical components, i.e., C, Si, Mn and Ni to Model-2.

Fig. 2.
figure 2

The three modeling schemes.

During the training phase, the input pattern along with its corresponding desired VHN value (the measured value) is applied to the MLP-based model. The output of the model is computed after processing the input pattern in the nodes of different layers of the MLP. Thereafter, the model output is compared with the desired output to generate an error. The resulting error is used to update the weights of the model using the BP algorithm. This process is continued until the MSEtrain settles to a small value. Thereafter, the trained model is tested for prediction of VHN.

4 Experimental Setup and the Dataset

In the current study, we consider eight specimens with different chemical compositions that have undergone the austempering process with different austempering temperatures and time durations. The experimental data are taken from the article by PourAsiabi et al. [24]. The chemical compositions of the eight specimens (in wt%) are given in Table 1. The specimens were austenitized and then were rapidly quenched and soaked in a salt bath at 260, 290, or 320 ℃ for different time durations, i.e., 30, 60, 90 or 120 min. In this way, twelve samples were created from each specimen, giving rise to a total of 96 samples. The VHN was measured for each sample under 10 kg loading. More details of the experimental setup and procedure can be found in [24].

Table 1. Chemical Composition of eight specimens in wt%. From PourAsiabi et al. [24].

5 Simulation Setup

All the input variables are normalized between 0 and 1, as per the requirement of the MLP-based modeling. Out of 96 data samples, we used 75% data (72 samples) as training set and the remaining 25% data (24 samples) are used for testing purpose. The learning parameter and momentum factor used in the BP algorithm are gradually decreased with the increase of the iteration, in order to have a faster learning. A bias unit with an output value of +1.0 is added to all layers except the output layer of the MLP. For each model, we carried out several experiments in order to achieve the best architecture and learning parameters. The hidden layer nodes were varied from 5 to 9, and the learning parameters were varied from 0.1 to 0.8. At the end, the best architecture and learning parameters were obtained. The MLP architecture of {5-7-1} in Model-1 implies that it is 3-layer MLP with 5, 7, and 1 node in the input, hidden and output layers (including the bias unit), respectively. Training continued for 10,000 iterations. The MSE at the end of training is found to be lower than −40 dB. This low value of the MLP indicates that the MLP-based models have been well-trained. The architecture of the MLP in Model-2 and Model-3 are given by {6-7-1} and {10-7-1}, respectively. Both the initial learning and momentum rates for the three models are selected as 0.5.

6 Results and Discussion

The scatter plot between the measured and the model estimated VHN values for the complete (96 samples) dataset is shown in Fig. 3a for Model-1. The values of CC, MAPE, and MSE are also shown in this Fig. It can be seen that the CC for both datasets are close to 1.0. In comparison, the CC and MAPE of PourAsiabi et al. [24] are 0.9912 and 0.71 respectively. Figure 3b and c shows scatter plots for Model-2 and Model-3 respectively. In Model-2, the CC values for the test set and the complete set are found to be 0.9987 and 0.9992, respectively, thus indicating the effectiveness of modeling. Low values of MAPE indicated high accuracy of estimation of VHN. In Model-3, the CC values for the test set and the complete set are found to be 0.9993 and 0.9995, respectively. The MAPE values for the two sets are found to be 0.33 and 0.22, respectively. These values are superior to the other two models. This is expected, because the VHN is dependent upon the interactions of all the alloying elements during the austempering process. Thus, Model-3 is a complete one that can capture the ADI process accurately.

Fig. 3.
figure 3

Scatter plot of measured and estimated VHNs for Model-1, Model-2 and Model-3.

7 Prediction of VHN with MLP Models

Prediction performances of Model-2 at different values of temperature and at four specific austempering time durations, i.e., 30, 60, 90 and 120 min are shown in Fig. 4, for the eight specimens. The red symbols indicate the measured values and the blue symbols denote the predicted values of VHN. It can be seen that the model is able to predict VHN accurately at different values of austempering temperature ranging from 250 to 330 ℃. Prediction performances of Model-3 at different values of austempering time duration at three specific austempering temperature, i.e., 260, 290, or 320 ℃ are shown in Fig. 5, for the eight specimens. The model is able to predict VHN accurately for austempering time duration ranging from 20 to 130 min. This aspect of the MLP-based model has not been reported in [24] or elsewhere.

Fig. 4.
figure 4

Model-2 performance. Prediction of VHN at different austempering temperature duration of 30, 60, 90 and 120 min. (Color figure online)

Fig. 5.
figure 5

Model-3 performance. Prediction of VHN at different austempering time duration for austempering temperature of 260, 290 and 320 ℃. (Color figure online)

8 Conclusion

We proposed ANN-based techniques for modeling austempering process of ADI to predict VHN. These models can effectively predict VHN at a specific austempering temperature, time duration and, wt% of the seven chemical compositions. We have shown that the correlation coefficient between the measured and estimated VHN values is close to 1.0, thus, indicating high estimation accuracy. The performance of Model-1 is found to be superior to the similar model proposed in [24]. Unlike the other critical chemical constituents that are neglected in [24], the proposed Model-2 and Model-3, respectively, takes three and seven chemical constituents. The mean absolute errors are found to be as low as 0.27% and 0.22% in Model-2 and Model-3, respectively. These models can predict VHN accurately at any value (within a specified range) of austempering temperature and time duration, even beyond the range of training samples. Similar predictions were also achieved for different proportion of chemical compositions (in wt%) at a specific austempering temperature and time duration. One of the major limitations of the current work is the limited amount of available data. Here, we have considered only 96 data samples. However, in a future work, we intend increase dataset by considering other dataset available in the literature. However, these modeling techniques can help in saving time and cost of designing ADI for a specified VHN required for any application.