Introduction

In the manufacturing industry, radiographic testing (RT) is an important non-destructive testing technique for finding defects present inside a material. Radiographic images used in the classification of weld defects are of great significance for weld quality management, reliability, and large-scale equipment safety (Zahran and Al-Nuaimy 2002; Silva and Mery 2007a). Additionally, the results of weld inspection provide useful information for identifying potential problems in the fabrication process, which is necessary for improving welding operations (Liao 2009; Zapata et al. 2012). Therefore, improving the efficiency and accuracy of weld defect classification is an important technical problem.

Fig. 1
figure 1

Process of automated RT system

In current weld quality management, many efforts have been devoted for developing an automated inspection system to improve the accuracy and efficiency of RT (Silva and Mery 2007b; Liao 2003). An automated RT system usually includes the following four steps (Liao 2009; Shen et al. 2010), as shown in Fig. 1. (1) Image preprocessing mainly includes noise reduction and image enhancement. Noise reduction smoothes the RT images, and image enhancement improves the contrast and highlights the defects and details in RT images. Image enhancement usually includes contrast enhancement and edge enhancement methods (Gu et al. 2013; Mu et al. 2013) such as the histogram transformation method (Zahran et al. 2013), gray level transformation method (Gao and Hu 2014), and fuzzy enhancement method (Movafeghi 2015). (2) Defect segmentation finds and isolates weld defect regions from the rest of the radiographic image, providing the foundation of feature extraction and defect classification in automated RT systems (Nacereddine et al. 2006, 2013). (3) Feature extraction obtains a set of features—generally including edge-based and region-based features—that can describe the characteristics of the weld defects (Shen and Gao 2010). (4) Defect classification, i.e., defect pattern classification, which classifies defects according to the extracted defect features, is the focus of this paper.

It should be noted, there are many other methods relevant to weld quality management, such as thermal image analysis, ultrasonic testing (UT), and pulsed thermography (PT). For example, Sreedhar et al. (2012) proposed an online weld-monitoring system based on thermal image analysis via infrared cameras for tungsten inert gas (TIG) welding. Salchak et al. (2016) proposed a method for classifying possible defects with corresponding dimension limits based on UT. Du et al. (2008) developed a UT-based weld defect classification method by integrating a discriminant tracing algorithm and a probabilistic neural network. Zhu et al. (2017) developed an improved feature extraction algorithm for eddy current PT to realize automatic defect identification. RT, UT, and PT each have limitations, e.g., defects lying in certain planes cannot be detected by some of the methods. Because RT is widely used in current practical applications, especially in the turbine manufacturing industry, this paper focuses on RT and its application.

In recent decades, many studies have been performed on the classification of weld defects (Silva and Mery 2007a, b), and several methods have been applied to the classification of defects: fuzzy systems, artificial neural networks (ANNs), and support vector machines (SVMs) (Zhang et al. 2005). For example, Liao (2003) proposed a fuzzy expert system approach for defect classification, for which the most important step is acquisition of fuzzy rules. Zapata et al. (2010) proposed an adaptive-network—based fuzzy inference system for classification of weld defects. Fuzzy systems have a tradeoff between accuracy and interpretability, i.e., the accuracy can be improved but at the cost of interpretability. Wang and Liao (2002) proposed a method based on a multi-layer perceptron (MLP)-ANN with 12 features for defect classification. Lim et al. (2007) selected an optimal set of features from 25 features using a statistical approach and then used an MLP-ANN to classify the defects. ANNs can achieve good performance with a large number of training samples, but they can make judgment errors with only a small number of training samples. Zhang et al. (2005) utilized asymmetrical SVMs to classify weld defects. Li (2009) employed an SVM to classify a local defect embedded in a homogeneous copper-clad laminate surface. A direct multiclass SVM (DMSVM) for classifying defects has also been proposed (Shen et al. 2010). The SVM-based methods have good generalization, but their performance depends on the selected defect samples and defect features. In addition, there are some methods that integrate different classification theories. For example, You et al. (2015) developed a laser weld defect classification method by integrating a feed-forward neural network and an SVM. Fan et al. (2016) proposed a solder bump recognition method integrating fuzzy theory and an SVM. Mu et al. (2013) proposed a weld defect classification method based on principal component analysis (PCA) and an SVM, which is more effective than traditional SVM methods. Generally, these methods belong to machine learning/data mining techniques, and classification accuracy is greater in such techniques. Ultimately, all of the aforementioned methods cannot give the probability that a weld defect would belong to a certain type owing to the small differences between some types of weld defects.

Dempster–Shafer (DS) evidence theory (Pohl and Genderen 1998), which can address the uncertainty associated with classification, is currently being introduced to the field of pattern recognition. For example, Hong et al. (2012) proposed a method for fault diagnosis based on a modular neural network and DS theory. Liu et al. (2011) introduced an approach for glass defect identification based on DS evidence theory. Gao et al. (2012) applied DS evidence theory to fuse different features in image subcategory classification. DS evidence theory has also been applied to fault condition diagnosis (Pan et al. 2011), which can combine multiple features to decide the fault condition type. In fact, the classification of weld defects is a process of multi-information fusion, i.e., DS theory can be applied to fuse the different weld defect features (henceforth called evidence) for classification.

However, there are several problems in applying DS to weld defect classification. The first problem is the calculation of weighting factors for each feature when using features as the evidence, i.e., each feature of a defect has a different contribution for classification, whereas traditional methods assign features equal weighting. To handle this problem, the analytic hierarchy process (AHP) is used in this work. AHP (Saaty 1980, 1990) is a decision method that can split the elements related to the decision into different hierarchies such as goals, principles, and programs (Maruthur et al. 2013). Furthermore, AHP combines qualitative analysis with quantitative analysis, so it can effectively solve complex decision problems whose objects and criteria are difficult to quantify (Hafizan et al. 2016). In order to promote the scientific nature of weight prediction and avoid the contradiction between weight prediction and actuality due to subjectivity, AHP can be used to calculate the weight of features (WF) before applying a DS-based method. The second problem is the standard value of feature (SVF) identification for elements in the frame of discernment. The value of each feature (e.g., length, aspect ratio) for a certain type of defect varies, so it should be decided before using the DS method. The third problem is the calculation of the combined rules for multi-evidence cases. This issue is crucial to the success of applying DS methods; in contrast, traditional methods do not utilize weighting factors for features, so they cannot be directly applied to the classification of weld defects.

To overcome the aforementioned problems, a novel method based on AHP and DS evidence theory is proposed for weld defect classification. First, a method based on AHP is proposed to calculate the WF for weld defects. Then, an improved method based on DS evidence theory that includes calculation of the SVF based on frequency histogram analysis (FHA) and an improved Dempster’s rule of combination based on the WF is presented. Finally, a case study involving the classification of steam turbine weld defects is provided to illustrate the proposed techniques. This method enables application of DS theory to weld defect classification, and it classifies weld defect types with better performance than traditional methods.

The paper is organized as follows. “Feature selection” section describes the features that are used in this paper. “Calculation of WF based on AHP” section discusses the WF based on AHP. “Improved DS method for weld defect classification” section introduces the improved DS method for weld defect classification. “Weld defect classification case study” section describes the weld defect classification case study. “Results” section presents the results of the case study. “Discussion” section discusses how the proposed method compares to other classification methods. Finally, “Conclusion” section presents the conclusions on the proposed method and gives suggestions for future research.

Feature selection

As highlighted in Fig. 2, there are generally five types of weld defects: porosity (PO), slag inclusion (SL), lack of penetration (LP), lack of fusion (LF), and crack (CR). The features of weld defects can be categorized into two classes: edge-based features and region-based features. Edge-based features are principally centered on measured geometric properties, and region-based features are focused on the intensity characteristics. As shown in the white boxes in Fig. 2, the weld defect area should be segmented before feature extraction and feature selection. In this paper, seven defect features were selected from previous works (Shen et al. 2010; Mu et al. 2013), as shown in Table 1. These features can be used for defect classification based on DS theory, i.e., the features can be described as evidence and fused to determine the class of a weld defect.

Fig. 2
figure 2

Defect samples: a porosity (PO), b slag inclusion (SL), c lack of penetration (LP), d lack of fusion (LF), and e crack (CR)

Table 1 Defect features

Calculation of WF based on AHP

In traditional classification methods, features are usually weighted equally, and the obtained feature values are directly used for defect classification. However, in the process of defect classification, the role of each feature varies with the defect type, i.e., different features have different weights in defect classification. If the same weight is used for each feature, the classification accuracy may be reduced. In contrast, assigning different weights to different features according to the defect type improves the classification accuracy. Therefore, the WF should be determined before applying the DS-based method. In this paper, a method based on AHP for calculating the WF is proposed. The specific process of this method is as follows:

  1. (1)

    The evaluation index \(w_{i} \left( {i=1, \ldots , 7} \right) \), which represents the weight value of weld defect features, is determined. For example, \(w_{1} ,w_{2} ,\,\ldots ,w_{7}\) can be defined such that they represent the weight values of \(L,\hbox {Ar},\, \ldots ,\hbox {K}_{\mathrm{u}}\), respectively, as shown in Table 1.

  2. (2)

    The comparison matrix A is constructed as follows:

    $$\begin{aligned} A=\left[ {{\begin{array}{cccc} {\frac{w_1 }{w_1 }}&{}\quad {\frac{w_1 }{w_2 }}&{}\quad \cdots &{}\quad {\frac{w_1 }{w_n }} \\ {\frac{w_2 }{w_1 }}&{}\quad {\frac{w_2 }{w_2 }}&{}\quad \cdots &{}\quad {\frac{w_2 }{w_n }} \\ \cdots &{}\quad \cdots &{}\quad \cdots &{}\quad \cdots \\ {\frac{w_n }{w_1 }}&{}\quad {\frac{w_n }{w_2 }}&{}\quad \cdots &{}\quad {\frac{w_n }{w_n }} \\ \end{array} }} \right] =\left[ {{\begin{array}{cccc} {a_{11} }&{}\quad {a_{12} }&{}\quad \cdots &{}\quad {a_{1n} } \\ {a_{21} }&{}\quad {a_{22} }&{}\quad \cdots &{}\quad {a_{2n} } \\ \cdots &{}\quad \cdots &{}\quad \cdots &{}\quad \cdots \\ {a_{n1} }&{}\quad {a_{n2} }&{}\quad \cdots &{}\quad {a_{nn} } \\ \end{array} }} \right] , \end{aligned}$$
    (1)

    where \(a_{ij} =w_{i} /w_{j}\) represents the pair-wise comparison of index \(w_{\mathrm{i}}\) relative to index \(w_{\mathrm{j}}\). For simplicity, the value of \(a_{ij} \) was defined directly in this study, rather than from \(w_{i}\), and \(a_{ij}\) was set based on a 1-to-9 scale of relative importance, which is also called the Saaty scale. Odd numbers (1, 3, 5, 7, 9) represent equal importance, moderate importance, strong importance, very strong importance, and extreme importance, respectively, and even numbers (2, 4, 6, 8) represent intermediate values between adjacent judgments (Hafizan et al. 2016).

  3. (3)

    In order to test the consistency of markings in matrix A, the random consistency ratio (CR) is calculated as

    $$\begin{aligned} CR=\frac{CI}{RI}. \end{aligned}$$
    (2)

In the formula, the coincident indicator (CI) is

$$\begin{aligned} CI=\frac{\lambda _{\mathrm{max}} -n}{n-1}, \end{aligned}$$
(3)

where \(\lambda _{\mathrm{max}}\) is the maximum eigenvalue of A and RI is a reference value, i.e., an indicator of the impact of category i on the reference area. The CR ranges from zero to one, and it can be used to determine the consistency of the judgment entries. It is widely agreed that the comparison matrix, A, is acceptable if CR \(< 0.1\). Otherwise, the comparison matrix should be revised.

  1. (4)

    If CR < 0.1, according to Eqs. (2) and (3), the matrix weight values, \(w_{i}\), are

    $$\begin{aligned} w_i =\frac{v_{i}}{\mathop {\sum }\nolimits _{i=1}^{n} {v_{i}}}, \end{aligned}$$
    (4)

where \(v_{i} =\root n \of {\mathop {\prod }\limits _{j}^{n}} {a_{ij}}\). Therefore, the list of weights is

$$\begin{aligned} W_{\mathrm{index}} =\left\{ {w_1 ,w_2 ,\ldots ,w_{\mathrm{n}} ,} \right\} . \end{aligned}$$
(5)

Thus, \(W_{\mathrm{index}}\) can be used as the WF during weld defect classification.

Selecting features and determining relationships between features are key steps when using AHP. If the selected features are not reasonable, or the relationships between features are not correct, as shown in Eq. (1), the results of AHP will be inaccurate. Thus, to ensure the rationality of the hierarchy and the accuracy of defect classification, the selected features and the relationship between features should be reasonable and correct.

Improved DS method for weld defect classification

Brief introduction of DS evidence theory

DS theory is an important method in uncertainty reasoning first introduced by Dempster (1968) and later extended by Shafer (1976); it plays vital roles in the representation and fusion of uncertain information and the study of random sets (Han et al. 2014). DS theory is widely used in fields such as information fusion, pattern recognition, and decision analysis. DS theory is defined as follows.

If \(\Theta \) represents a finite set of elements, which is commonly known as the frame of discernment, the power set \(2^{\Theta }\) is the set of all subsets of \(\Theta \), including itself and the null set \(\varphi \). The basic probability assignment (BPA), m(A), also known as the mass function, is assigned to an individual in a subset of the power set. The BPA, m(A), is defined on the bounded interval [0,1], and it satisfies

$$\begin{aligned} \left\{ {{\begin{array}{l} {m(\varphi )=0,} \\ \mathop {\sum }\limits _{A\subseteq \Theta } {m(A)=1}. \\ \end{array}}} \right. \end{aligned}$$
(6)

The belief function (Bel) is a function of each subset A of \(\Theta \), and Bel(A) is a measure of one’s belief that proposition A is correct. It is defined as follows:

$$\begin{aligned} Bel(A)=\mathop {\sum }\limits _{B\subset A} {m(B)} (\forall A\subset \Theta ). \end{aligned}$$
(7)

Dempster’s rule of combination, which provides an approach through which several belief functions can be determined in the same \(\Theta \), is defined as

$$\begin{aligned} m_{joint} (C)= & {} \left( {m_1 \oplus m_{2}} \right) (C)\nonumber \\= & {} \left\{ {{\begin{array}{ll} {\frac{1}{1-K}\sum _{A_1 \cap A_2 =C} {m_1 (A_1 )m_2 (A_2 )},}&{} \quad {\forall C\subset \Theta ,C\ne \varphi ,} \\ 0, &{}\quad C=\varphi , \\ \end{array}}} \right. \nonumber \\ \end{aligned}$$
(8)

where K is the normalization factor,

$$\begin{aligned} K=\sum _{A_1 \cap A_2 =\varphi } {m_1 (A_1 )m_2 (A_2 )} < 1, \end{aligned}$$
(9)

which denotes the degree of conflict between \(A_{1}\) and \(A_{2}\), with 0 representing no conflict and 1 representing total conflict. If K is too large, Dempster’s rule of combination will fail. With Eqs. (8) and (9), the fusion vector R is

$$\begin{aligned} R=\left[ {r_{1} ,r_{2} ,\ldots , r_{N} } \right] , \end{aligned}$$
(10)

where \(r_{i} (i=1,2,\ldots ,N)\) represents the output of each class in the frame of discernment, \(\Theta \). To assist in numeric decision-making for weld defect classification, the “decision rule” is formulated as follows: if \(r_{i} >\eta \, (i=1,2,\ldots ,N)\) and \(r_{i}\) is the maximum, then target S is of class \(X_{i}\) . In this study, the threshold \(\eta \) is set as 0.50.

The following example serves to describe the detailed steps of the calculation of R. Supposing a frame of discernment \(\Theta = \{A_{1},\, A_{2}, {\theta }\}\) where each proposition in \(\Theta \) symbolically represents two weld defects classes (i.e., \(A_{1}\) represents PO and \(A_{2}\) represents SL), and \(\theta \) represents the uncertainty of a defect belonging to \(A_{i}\), three BPAs \((m_{1}, m_{2}\), and \(m_{3})\) are considered in this frame of discernment:

$$\begin{aligned} m_{1}< A_{1,} A_{2,} \theta>= & {} m_{1}< 0.8,\, 0.1, \,0.1>, \\ m_{2}< A_{1,} A_{2,} \theta>= & {} m_{2}< 0.6,\, 0.3, \,0.1>, \\ m_{3}< A_{1,} A_{2,} \theta>= & {} m_{3} < 0.75,\, 0.2, \,0.05 >. \end{aligned}$$

Then, the new BPA, \(m_{\mathrm{joint}}\), is calculated based on \(m_{1}\) and \(m_{2}\) with Eqs. (8) and (9):

$$\begin{aligned} K= & {} \mathop {\sum }\limits _{A_1 \cap A_2 =\varphi } {m_{1} (A_1 )m_2 (A_2 )} \\= & {} m_{1} (A_1 )m_2 (A_2 )+m_1 (A_2 )m_2 (A_1 )=0.24+0.06=0.3, \\ m_{\mathrm{joint}} (A_1 )= & {} \frac{m_1 (A_{1} )m_2 (A_1 )+m_1 (A_1 )m_2 (\theta ) +m_{1} (\theta ) m_2 (A_1 )}{1-K}\\= & {} \frac{0.\hbox {48}+0.0\hbox {8}+0.0\hbox {6}}{1-0.3}=0.\hbox {8857}, \\ m_{\mathrm{joint}} (A_2 )= & {} \frac{m_{1} (A_{2})m_2 (A_2 )+m_1 (A_2 )m_2 (\theta ) +m_1 (\theta )m_2 (A_2 )}{1-K}\\= & {} \frac{0.0\hbox {3}+0.0\hbox {3}+0.0\hbox {1}}{1-0.3}=0.1, \\ m(\theta )= & {} \frac{m_1 (\theta )m_2 (\theta )}{1-K}=\frac{0.01}{1-0.3}=0.0143. \end{aligned}$$

In the same manner, \(m_{\mathrm{joint}}\) can be combined with \(m_{3}\) as:

$$\begin{aligned} K= & {} m_{\mathrm{joint}} (A_{1})m_3 (A_2 )+m_{\mathrm{joint}} (A_{2})m_{3} (A_{1})\\= & {} 0.11714+0.075=0.25214, \\ {m}^{'}_{\mathrm{joint}} (A_{1})= & {} \frac{m_{\mathrm{joint}} (A_{1})m_3 (A_{1})+m_{\mathrm{joint}} (A_1 )m_3 (\theta )+m(\theta )m_{3} (A_{1})}{1-K}\\= & {} 0.9618, \\ {m}^{'}_{\mathrm{joint}} (A_{2})= & {} 0.0373, \\ {m}^{'}(\theta )= & {} \frac{m(\theta )m_{3} (\theta )}{1-K} =\frac{0.0143\times 0.05}{1-0.25214}=0.0009. \end{aligned}$$

Here, the fusion vector, R, is defined as

$$\begin{aligned} R= & {} \left[ {r_{1},\, r_{2}, \, r_{3} } \right] =\left[ {m}^{'}_{\mathrm{joint}} (A_{1}),\, {m}^{'}_{\mathrm{joint}} (A_{2}),\;{m}^{'}(\theta )\right] \\= & {} \left[ 0.9618,\, 0.0373,\, 0.0009\right] . \end{aligned}$$

Finally, each \(r_{j} \left( {j= 1,\, 2, 3} \right) \) represents the output of each class in the frame of discernment \(\Theta =\left\{ {A_1 ,A_2 ,\theta }\right\} ,\) and \(r_{j}\) represents the belief committed exactly to \(A_{j}\). This example shows that the weld defect class is most likely class \(A_{1}\) (porosity) because \(r_{1}\) is the maximum value in the fusion vector, R.

SVF identification in frame of discernment

Given N classes of weld defects and M features, the weld defect class can be described as a standard class matrix, \(S_{0}\):

$$\begin{aligned} S_{0} =\left[ {{\begin{array}{l} {X_{1}} \\ {X_{2}} \\ \vdots \\ {X_{N}}\\ \end{array}}} \right] =\left[ {{\begin{array}{cccc} {x_{11}}&{}\quad {x_{12}}&{}\quad \cdots &{}\quad {x_{1M}} \\ {x_{21}}&{}\quad {x_{22}}&{}\quad \cdots &{}\quad {x_{2M}} \\ \vdots &{}\quad \vdots &{}\quad \cdots &{}\quad \vdots \\ {x_{N1}}&{}\quad {x_{N2}}&{}\quad \cdots &{}\quad {x_{NM}} \\ \end{array}}} \right] . \end{aligned}$$
(11)

In addition, the frame of discernment can be defined as \(\Theta = \{X_{1}, X_{2}, {\ldots }, X_{\mathrm{N}}\}\), where \(X_{i}\) represents the i-th class of weld defect.

When using the DS method, the first problem is deciding \(\Theta \) by determining the SVF, \(x_{ij} (j = 1, 2, {\ldots }, M)\), for \(X_{\mathrm{i}}\). Because the \(x_{ij}\) have different values, some will be too large, while others will be small, so it is necessary to develop a reliable method for obtaining a standard \(x_{ij}\). Generally, the value of feature \(x_{ij}\) approximately obeys the normal distribution, so a method was developed to calculate feature \(x_{ij}\) based on FHA (Dudewicz 1999). As shown in Fig. 3, FHA is a statistical tool for presenting numerous data in a form that clarifies the central tendency and the dispersion along the scale of measurement. Many feature samples, \(x_{ij}\), were used to construct the histograms, wherein each column represents the frequency of a certain value of \(x_{ij}\). Then, the standard deviation of \(x_{ij}\), which follows a normal probability distribution (NPD), is calculated and used as an estimate of \(\upsigma \). Finally, because ± 3\(\upsigma \) standard deviations includes 99.73% of the population, most \(x_{ij}\) are included for calculating the SVF, \(x_{ij}\), and the mean value of samples (MVS) can be calculated if the samples belong to ± 3\(\upsigma \) standard deviations; otherwise, if samples are beyond the ± 3\(\upsigma \) standard deviations, they should not be used for the calculation. Then, the MVS can be used as the SVF, \(x_{ij}\), and the frame of discernment \(\Theta \) can be obtained, which is more reliable for defect classification.

Fig. 3
figure 3

Frequency histograms analysis (FHA) for feature \({x}_{ij}\)

Improved Dempster’s rule of combination based on WF

As shown in “Brief introduction of DS evidence theory” section, the calculation of m is another crucial aspect that determines whether DS evidence theory will succeed. In this paper, the BPA, m, is calculated based on the WF of each feature and the frame of discernment \(\Theta \); namely, the main steps of calculating m are similar to the method proposed in previous work (Jiang et al. 2016).

Supposing that the target weld defect, S, contains the testing data that need to be classified, it can be defined as the vector

$$\begin{aligned} S=\left[ {s_1 ,s_2 ,\cdots ,s_j ,\cdots ,s_M } \right] , \end{aligned}$$
(12)

where \(s_{j}\) represents the j-th feature of target weld defect S.

Based on Eqs. (11) and (12), the similarity matrix, P, which indicates the similarity of S to \(S_{0}\), can be obtained as

$$\begin{aligned} P=\left[ {{\begin{array}{cccc} {p_{11} }&{}\quad {p_{12} }&{}\quad \cdots &{}\quad {p_{1M} } \\ {p_{21} }&{}\quad {p_{22} }&{}\quad \cdots &{}\quad {p_{2M} } \\ \vdots &{}\quad \vdots &{}\quad \ddots &{}\quad \vdots \\ {p_{N1} }&{}\quad {p_{N1} }&{}\quad \cdots &{}\quad {p_{NM} } \\ \end{array} }} \right] =\left[ {{\begin{array}{l} {P_1 } \\ {P_2 } \\ \vdots \\ {P_N } \\ \end{array} }} \right] , \end{aligned}$$
(13)

where

$$\begin{aligned} p_{ij} =\hbox { }1/d_{ij}, \end{aligned}$$
(14)

and

$$\begin{aligned} d_{ij} =\left| {s_j -x_{ij} } \right| {\begin{array}{ll} &{} {i=1,2,\ldots ,N;j=1,2,\ldots ,M} \\ \end{array}}. \end{aligned}$$
(15)

As shown in Eq. (13), a larger \(P_{i}\) indicates that the target weld defect, S, more likely belongs to the i-th class of weld defect. Conversely, a smaller \(P_{i}\) indicates that the target weld defect, S, less likely belongs to the i-th class of weld defect.

Fig. 4
figure 4

Procedure for weld defect classification based on the proposed method

To obtain the BPA, m, the normalizing P is calculated via

$$\begin{aligned} {p}^{'}_{ij} ={p_{ji}}\Bigg /{\mathop {\sum }\limits _{k=1}^N {p_{kj}}}, \end{aligned}$$
(16)

such that

$$\begin{aligned} {P}^{'}=\left[ {{\begin{array}{llll} {{p}^{'}_{11} }&{}\quad {{p}^{'}_{12} }&{}\quad \cdots &{}\quad {{p}^{'}_{1N} } \\ {{p}^{'}_{21} }&{}\quad {{p}^{'}_{22} }&{}\quad \cdots &{}\quad {{p}^{'}_{2N} } \\ \vdots &{}\quad \vdots &{}\quad \ddots &{}\quad \vdots \\ {{p}^{'}_{M1}}&{}\quad {{p}^{'}_{M2} }&{}\quad \cdots &{}\quad {{p}^{'}_{MN} } \\ \end{array}}} \right] =\left[ {{\begin{array}{l} {{P}^{'}_{m_1}} \\ {{P}^{'}_{m_2}} \\ \vdots \\ {{P}^{'}_{m_M } } \\ \end{array} }} \right] . \end{aligned}$$
(17)

Then, \(P^{\prime }\) can be defined as the BPA, m, of \(X_{j}\), which satisfies Eq. (6):

$$\begin{aligned} \left[ {{\begin{array}{cccc} {m_1 (X_1 )}&{}\quad {m_1 (X_2 )}&{}\quad \cdots &{}\quad {m_1 (X_N )} \\ {m_2 (X_1 )}&{}\quad {m_2 (X_2 )}&{}\quad \cdots &{}\quad {m_2 (X_N )} \\ \vdots &{}\quad \vdots &{}\quad \vdots &{}\quad \vdots \\ {m_M (X_1 )}&{}\quad {m_M (X_2 )}&{}\quad \cdots &{}\quad {m_M (X_N )} \\ \end{array} }} \right] =\left[ {{\begin{array}{l} {{P}^{'}_{m_1 } } \\ {{P}^{'}_{m_2 } } \\ \vdots \\ {{P}^{'}_{m_M } } \\ \end{array} }} \right] . \end{aligned}$$
(18)

Considering the WF, which was calculated in “Calculation of WF based on AHP” section, to account for the different WFs in defect classification, Dempster’s rule of combination was revised as follows:

$$\begin{aligned} m_{i}^{\prime } (X)=w_i \cdot m_i (X) \quad (i=1,2,\ldots M). \end{aligned}$$
(19)

Then, Dempster’s rule of combination from Eqs. (8) and (9) are revised as:

$$\begin{aligned} m_{joint} (C)= & {} \left( {m_1^{\prime } \oplus m_2^{\prime }} \right) (C)\nonumber \\= & {} \left\{ {{\begin{array}{ll} {\frac{1}{1-K}\mathop {\sum }\limits _{A_1 \cap A_2 =C} {m_1^{\prime } (A_1 )m_2^{\prime } (A_2 )} }&{}\quad {\forall C\subset \Theta ,C\ne \varphi } \\ 0&{}\quad {C=\varphi } \\ \end{array}}} \right. ,\nonumber \\ \end{aligned}$$
(20)

where

$$\begin{aligned} K=\mathop {\sum }\limits _{A_1 \cap A_2 =\varphi } {m_1^{\prime } (A_{1})m_2^{\prime } (A_{2})<1}. \end{aligned}$$
(21)

The fusion vector, R, can be calculated based on \(m_i^{\prime } (i=1,2,\ldots M)\), and the main steps of the method can be described as follows:

  1. (a)

    Combining \(m_{1}^{\prime } \) with \(m_{2}^{\prime } \) using Eqs. (20) and (21) yields \(m_{joint} \).

  2. (b)

    Combining \(m_{joint} \) with \(m_{3}^{\prime } \) using Eqs. (20) and (21) yields a new \(m_{joint} \).

  3. (c)

    Repeating step (b) for the remaining \(m_{i}^{\prime } (i=1,2,\ldots M)\) yields the final fusion vector R.

In summary, the main steps of the weld defects classification method based on AHP and DS evidence theory are shown in Fig. 4. Firstly, several techniques, such as image preprocessing, defect segmentation, and feature extraction, are performed sequentially to obtain features for defect classification. Second, as shown in the dashed-line box in Fig. 4, the procedure of the proposed method includes three steps:

  1. (a)

    The WF of the selected features is calculated based on AHP using Eqs. (1)–(5).

  2. (b)

    The standard class matrix, \(S_{0}\), is calculated based on FHA using Eq. (11).

  3. (c)

    The weld defect is classified using an improved Dempster’s method based on the \(S_{0}\) and WF of each feature. Specifically, the BPA, m, is calculated using Eqs. (12)–(18), and the fusion vector, R, is caluclated using Eqs. (19)–(21).

Weld defect classification case study

To illustrate the effectiveness of the method proposed in this paper, 172 samples covering 5 classes of weld defects were selected: 32 PO, 41 SL, 38 LP, 35 LF, and 26 CR. The radiographic images were provided by the Dongfang Turbine Co., Ltd, Sichuan, China, and the samples were produced from the digitized radiographic images using an X-ray film scanner (JD-RTD) that was developed in-house, as shown in Fig. 5.

Fig. 5
figure 5

Automatic radiographic testing system (JD-RTD)

Table 2 Standard class matrix, \(S_{0}\)

With these five classes of weld defect, the weld defects’ frame of discernment can be defined as

$$\begin{aligned} \Theta =\left\{ {X_{1} ,\;X_{2} ,\;X_{3} ,\;X_{4} ,\;X_{5}} \right\} , \end{aligned}$$
(22)

where \(X_{1} ,X_{2} ,X_{3} ,X_{4},\) and \(X_{5}\) represent PO, SL, LP, LF, and CR, respectively. First, as demonstrated in “Calculation of WF based on AHP” section, the comparison matrix, A, can be obtained via AHP:

$$\begin{aligned} A= & {} \left[ {{\begin{array}{cccc} {a_{11} }&{}\quad {a_{12} }&{}\quad \cdots &{}\quad {a_{1n} } \\ {a_{21} }&{}\quad {a_{22} }&{}\quad \cdots &{}\quad {a_{2n} } \\ \cdots &{}\quad \cdots &{}\quad \cdots &{}\quad \cdots \\ {a_{n1} }&{}\quad {a_{n2} }&{}\quad \cdots &{}\quad {a_{nn} } \\ \end{array} }} \right] \nonumber \\= & {} \left[ {{\begin{array}{c|ccccccc} &{} L &{}\quad {Ar} &{}\quad {Sp} &{}\quad {Re} &{}\quad {Rr} &{}\quad {Sk} &{}\quad {Ku} \\ \hline L &{}\quad 1 &{}\quad {\hbox {0.5}} &{}\quad \hbox {4} &{}\quad \hbox {3} &{}\quad {\hbox {0.33}} &{}\quad \hbox {5} &{}\quad \hbox {6} \\ {Ar} &{}\quad \hbox {2} &{}\quad 1 &{}\quad \hbox {5} &{}\quad \hbox {3} &{}\quad {\hbox {0.5}} &{}\quad \hbox {6} &{}\quad \hbox {7} \\ {Sp} &{}\quad {\hbox {0.25}} &{}\quad {\hbox {0.2}} &{}\quad 1 &{}\quad {\hbox {0.5}} &{}\quad {\hbox {0.17}} &{}\quad \hbox {2} &{}\quad \hbox {3} \\ {Re} &{}\quad {\hbox {0.5}} &{}\quad {\hbox {0.33}} &{}\quad \hbox {2} &{}\quad 1 &{}\quad {\hbox {0.2}} &{}\quad \hbox {3} &{}\quad \hbox {5} \\ {Rr} &{}\quad \hbox {3} &{}\quad \hbox {2} &{}\quad \hbox {6} &{}\quad \hbox {5} &{}\quad 1 &{}\quad \hbox {7} &{}\quad \hbox {9} \\ {Sk} &{}\quad {\hbox {0.2}} &{}\quad {\hbox {0.17}} &{}\quad {\hbox {0.5}} &{}\quad {\hbox {0.33}} &{}\quad {\hbox {0.14}} &{}\quad 1 &{}\quad \hbox {2} \\ {Ku} &{}\quad {\hbox {0.17}} &{}\quad {\hbox {0.14}} &{}\quad {\hbox {0.33}} &{}\quad {\hbox {0.2}} &{}\quad {\hbox {0.11}} &{}\quad {\hbox {0.5}} &{} 1 \\ \end{array} }} \right] .\nonumber \\ \end{aligned}$$
(23)

From Eqs. (2) and (3), \(CR=0.034 < 0.1\), which means that the consistency is acceptable. Therefore, the WF can be obtained via Eqs. (4) and (5):

$$\begin{aligned} W= & {} \left\{ {{\begin{array}{ccccccc} {w_1 }&{} {w_2 }&{} {w_3 }&{} {w_4 }&{} {w_5 }&{} {w_6 }&{} {w_7 } \\ \end{array}}} \right\} \nonumber \\= & {} \{{\begin{array}{ccccccc} {\hbox {0.17}}&{} {\hbox {0.24}}&{} {\hbox {0.06}}&{} {\hbox {0.1}}&{} {\hbox {0.37}}&{} {\hbox {0.04}}&{} {\hbox {0.03}} \\ \end{array} }\}. \end{aligned}$$
(24)

For the frame of discernment, \(\Theta \), all 172 samples were selected for calculation of the standard class matrix, \(S_{0}\), in the frame of discernment based on the method described in “SVF identification in frame of discernment” section; the results of which are shown in Table 2.

After obtaining the standard class matrix, \(S_{0}\), one weld defect sample is selected as the target weld defect, S, and the method described in “Improved Dempster’s rule of combination based on WF” section is applied to obtain the fusion vector \(R= [r_1 ,r_2 ,r_3 ,r_4 ,r_5 ],\) where each \(r_{j} \left( {j=1,2,\ldots ,5} \right) \) represents the output of each \(X_{i}\) in the frame of discernment \(\Theta =\left\{ {X_1 ,X_2 ,X_3 ,X_4 ,X_5 } \right\} \), respectively. For each case analysis, each correct recognition rate (CRR\((i) (i=1,2,3)\)) is determined by the maximum, second-maximum, and third-maximum of R, respectively.

The following example illustrates the calculation of the CRR. Four samples of defect type \(X_{1}\) (PO) are selected as the target weld defects, S, and the result, R, is shown in Table 3, in which the bold entries indicate the maximum values of \(R_{i}\).

From Table 3, taking sample no. 1 as S, the fusion vector is \(R= \left[ {r_{1} ,r_{2} ,r_{3} ,r_{4} ,r_{5}} \right] =\left[ {0.93987,\,0.00448,\,0.00206,\,}\right. \left. {0.03830,\,0.01532} \right] \); thus, \(r_1 =\,0.93987\) satisfies the decision rule, as \(r_1 >\,0.5\) and it is the maximum of the fusion vector, so the target weld defect, S, can be correctly classified as \(X_{1}\) (PO). Because there is only one target weld defect S for which \(r_{1}\) is the maximum, the CRR(1) for classification is:

$$\begin{aligned} \hbox {CRR}\left( 1\right) = 1/4 = 25\%. \end{aligned}$$

Considering sample no. 2 of S, \(r_{1} =0.19593\) and \(r_2 =0.69580\); this is a classification error in that S is classified as SL with the decision rule. Moreover, because \(r_2>r_{1}>r_{5}>r_{4} >r_{3} ,S\) is less likely to be a PO weld defect than an SL weld defect. That is, if the classification is decided by the second-maximum \(r_1 \), the result is correct. Therefore, the \(r_i \) can be arranged in descending order, representing the corresponding possibility that S is classified as \(X_{\mathrm{i}}\). Because \(r_{1}\) (0.19593) is the second-maximum,

$$\begin{aligned} \hbox {CRR}\left( 2 \right) = 1/4 + \hbox {CRR}\left( 1 \right) = 50\% . \end{aligned}$$

Considering sample no. 3 of S, CRR(3) can similarly be calculated:

$$\begin{aligned} \hbox {CRR}\left( 3 \right) = 1/4 + \hbox {CRR}\left( 2 \right) = 75\% . \end{aligned}$$

Results

The resulting classifications of produced by the proposed method are shown in Table 4, which includes 172 samples covering 5 classes of weld defects. For LF and CR weld defects, the proposed method has excellent performance with CRR(1) at 91.42 and 96.15%, respectively. Because LF and CR weld defects differ greatly from the other weld defect classes, they can easily be classified correctly by using the WF. Because SL and LP weld defects are respectively similar to PO and CR weld defects, the method has poor performance with CRR(1) at 68.29 and 73.68% for SL and LP weld defects, respectively. Ultimately, for PO, SL, and LP, the proposed method has good performance with CRR(3) at 100%. This indicates that the method proposed in this paper yields significant separability, so it is suitable for classification of weld defects.

Table 3 Examples of R
Table 4 Classification results of weld defects

Discussion

Table 5 shows the aggregate CRR of the proposed method for all test samples; additionally, the performance of the proposed method is compared to those of the traditional DS method [7, 27, DMSVM, PCA-SVM, and MLP-ANN. The proposed method is more accurate than the traditional DS method because the CRR(1) and CRR(2) values obtained using the former method are higher than those from the latter method. This result indicates that the technique combined with the calculation of WF based on AHP strongly influences the results of the proposed method, i.e., reasonable weights are assigned to different features through AHP and the proposed technique improves the classification accuracy. For example, as for the LF and CR weld defects, the features of W, Ar, and Rr are very important factors in weld defect classification. Therefore, the CRR is higher when the WF of W, Ar, and Rr are higher. Meanwhile, because FHA can capture the feature’s distribution, the \(S_{0}\) calculated based on FHA with few training samples can be more reliable for defect classification, improving the performance of the proposed method.

Table 5 Comparison of performance

Although the CRR(1) of the proposed method is only 81.98%, CRR(2) is larger than the CRRs of DMSVM, PCA-SVM and MLP-ANN. If the three largest values of \(r_{i}\) are used together to decide the classification of the weld defect, CRR(3) is 100%. Additionally, an engineer sometimes needs to instead be given the probability of a defect belonging to the different categories—this accounts for some weld defects that cannot be classified due to the similarity between different types of defects, e.g., between LP and CR or between PO and LF. Thus, the proposed method not only has good classification performance, but it can also address the uncertainty associated with weld defect classes, i.e., by calculating the combinations of mass functions, the proposed method determines the belief committed to a weld defect class. Additionally, because the proposed method is an information fusion technique, it can be applied to some pattern recognition problems, e.g., fault diagnosis, target recognition and disease diagnosis.

However, the classification method based on DS also has some shortcomings. The first is deciding the frame of discernment \(\Theta \), i.e., deciding the SVF in the frame of discernment. Generally, the SVF can greatly improve the separability of elements in the frame of discernment. The second one is calculating the BPA, m, which is important for achieving higher defect classification accuracy. Additionally, as shown in “Brief introduction of DS evidence theory” section, accurate determination of m is essential for DS evidence theory to succeed. Furthermore, each method—including DMSVM, PCA-SVM, MLP-ANN, and the proposed method—has its limitations, e.g., PO, SL, and LP could each only be classified by some of the methods. Using two or more complementary methods would be a good solution to this problem.

Conclusion

In this paper, a method for classifying weld defects based on AHP and DS evidence theory was proposed; this new method mainly includes three novel contributions: (1) calculation of the WF of a weld defect based on AHP, (2) determination of the SVF in the frame of discernment, and (3) an improved Dempster’s rule for combination. A case study of weld defect classification was provided to illustrate the method and test its effectiveness. The results show that the proposed method can fuse different features to effectively make classification decisions based on few training samples and that it has better performance than traditional DS methods. Additionally, although the CRR(1) of the proposed method is slightly smaller than those of traditional methods, such as DMSVM and PCA-SVM, its CRR(2) is significantly larger than the CRRs of traditional methods. Notably, the proposed method can address the uncertainty associated with weld defect classification, i.e., it can give the probability of a defect belonging to different defect types if there is little difference between the weld defects.

Future work will focus on (1) studying the method for weld defect feature extraction and (2) calculation of the mass function and combined rules of DS evidence theory. In this paper, only seven features were used for classification; the CRR can be significantly improved if additional suitable features can be selected. Additionally, the mass function, also called the BPA, is crucial for successfully applying DS evidence theory, so better combination rules could eliminate conflicts between various forms of evidence. In fact, the classification error in the proposed method mostly resulted from conflicts between weld features. Therefore, the mass function and combination rules should be investigated in future studies.