Introduction

There are higher demands on aero engines’ service life and thermal efficiency in the new aircraft specifications (Ref 1, 2). The high-pressure turbine blade is one of the core components of an aero engine (Ref 3), and it works in environments of high temperature and high pressure, high stress load, and strong airflow erosion. Through practice, it has been proved that thermal barrier coating technology is currently the most effective way to significantly increase the service temperature and life of high-temperature alloys (Ref 4,5,6,7), and the quality of coatings is one of the critical factors influencing performance of high-pressure turbine blade. Atmospheric plasma spraying (APS) is used widely to produce thermal barrier coatings, there are more than 50 process parameters that influence the coupling between plasma and powder by influencing particle temperature, velocity, molten state and particle distribution, which finally determine a coating’s microstructure and performance (Ref 8, 9).

In order to guide the regulation of the APS process, some researchers established online monitoring means, such as online particle temperature, velocity monitoring and online performance (thickness, residual stress and mechanical properties) evaluation (Ref 10,11,12,13). Zhang et al. (Ref 14) explored the temperature and velocity trends of particles in the APS spraying process, as well as their connection with the coating morphology, and designed a comprehensive computational scheme for the integral solution of the control equations of the whole process of spraying, which achieved the characterization of the energy changes of their spraying process by a mathematical model. Ye et al. (Ref 15) used Spray Watch 2i (Oseir Ltd., Tampere, Finland) to monitor the process of supersonic plasma spraying, and investigated the relationship between the temperature of the particles in the spraying process and the variation of the spraying distance, which provided some theoretical references for the control of the spraying process parameters.

The protective ability of the thermal barrier coating to the sprayed substrate is determined by the microstructure of the coating, the macroscopic coating distribution and the interface morphology of the coating coverage area. In reference to Lorenzo-Bañuelos et al. (Ref 16), it is suggested that in order to optimize the three-dimensional morphological characteristics of coatings and their basic performance, it is important to establish a correlation between the standard process parameters of spraying and the morphological characteristics of coatings in practical spraying operations. In the realm of microscale morphology in coatings, the Gaussian distribution model has been widely applied for simulating coating profiles and predicting thickness (Ref 17,18,19). Furthermore, Wu et al. (Ref 20) have underscored the relationship between Gaussian function parameters and factors such as the actual feedstock flow rate, spray angle, and spraying distance. Trifa et al. (Ref 21) investigated the thickness distribution and interfacial morphology of APS coating profiles on a substrate. They made measurements of the coating profiles using two inspection devices: a coordinate measuring machine and a laser profiler. Additionally, the coating profile was fitted using a Gaussian function, and key parameters such as the maximum height, width at half-maximum height, area of deposit, and horizontal offset of the deposit peak relative to the reference line were extracted. These parameters were employed to define the geometric shape of the deposition model. A mathematical model was then established to elucidate the relationship between process parameters and the impact on the coating. Subsequently, Trifa et al. (Ref 22) analyzed the coating profile topography using this model and suggested the use of a set of parameters to comprehensively describe the essential features of the coating topography. This approach offers valuable insights for the process quality diagnosis of APS coatings. Although Trifa et al.’s research on coating profiles is primarily based on one-dimensional morphology (fixed-point deposition), it is essential to note that parameters such as the maximum height, full width at half maximum, and area of the one-dimensional Gaussian fitting curve directly impact the thickness of the final thermal barrier coating. The horizontal offset of the peak reflects the actual plasma jet center’s deviation and the tilt degree of the molten particles, both of which directly influence the porosity of the final coating. Therefore, the study of coating profiles plays a crucial role in controlling the thermal insulation performance of the coating.

Machine learning lies at the core of the fourth industrial revolution, commonly referred to as Industry 4.0. In the realm of spraying research, Mauer et al. (Ref 23) shed light on the substantial advantages and heightened reliability stemming from the integration of machine learning tools in the current diagnosis and control of thermal spraying processes. Bobzin et al. (Ref 24) utilized Residual Neural Network (ResNet) and Support Vector Machine (SVM) for the efficient estimation of in-flight particle properties in APS. Zhu et al. (Ref 25) applied Convolutional Neural Network (CNN) to optimize APS process parameters, particularly beneficial for newcomers to the field. Liu et al. (Ref 26, 27) employed Artificial Neural Network (ANN) to predict High Velocity Oxygen Fuel (HVOF) sprayed coatings and developed two ANN models for improved process control. Kolluru et al. (Ref 28) utilized ANN to predict shear strength in spark plasma sintering (SPS), while Zhao et al. (Ref 29) used statistical tools to analyze the impact of process parameters on the porosity of YSZ coatings. In cold spray research, Wang et al. (Ref 30) employed feature selection and ANN to predict critical velocity. Canales et al. (Ref 31) utilized machine learning to describe the deposition window and predict efficiency in cold spraying, suggesting optimization of material properties. In Electrostatic Spray Deposition (ESD) research, Paturi et al. (Ref 32) used ANN and SVM to predict coating thickness, and a genetic algorithm (GA) to optimize ESD deposition parameters. The aforementioned studies offer valuable methods for controlling the spraying process and establishing a mapping model between process parameters and coating performance. However, these studies were unable to simultaneously establish a connection between these three components. Furthermore, model training requires a larger sample dataset to ensure its accuracy.

This paper employs laser three-dimensional scanning technology to achieve rapid detection of the one-dimensional morphology of plasma-sprayed thermal barrier coatings. It conducts a digital analysis of the one-dimensional morphology, extracting Gaussian peak features from the coating cross section. Based on PCA-KNN and EFA models, the paper accomplishes swift diagnosis of coating quality and establishes the intrinsic correlation among coating process parameters, spraying processes, and coating quality. This provides a novel approach for diagnosing the process quality of thermal barrier coatings.

Experimental Procedures

Sample Preparation

Compared with the whole spraying process of industrial production, one-dimensional morphology coating is formed by fixed-point spraying, and two-dimensional morphology coating is formed by reciprocating linear motion spraying in the horizontal direction of the spray gun. At this time, the vertical feed motion is superimposed to form three-dimensional morphology coating. Macroscopically, it is similar to the process of point moving into line, line moving into surface and surface moving into body. The two-dimensional and three-dimensional morphology cannot directly reflect the coupling and migration of plasma jet and particles in all directions due to the stacking of molten particles. The one-dimensional morphology formed by fixed-point spraying provides a new perspective to directly study the influence of process parameters on quality, which lays a foundation for the subsequent study of two-dimensional morphology and three-dimensional morphology.

The one-dimensional morphology coating was prepared by using nano-agglomerated 8YSZ powder (38-61 μm, Beijing Jinlunkuntian Special Machine Co., Ltd., Beijing, China), was sprayed by 9 MB torch (Beijing Zhenbang Aerospace Precision Machinery Co., Ltd., Beijing, China), and the spraying power was 45 kW. The physical center of the anode outlet of the spray gun is perpendicular to the physical center of the sample. The spraying time of each sample was 10 s. Since in the plasma spraying process, the deviation variable generally consists of plasma gun power (affected by current, voltage and H2 flow), powder feeding parameters, main gas flow (Ar flow) and nozzle usage time, the experimental parameters of the anomalous samples under the deviation process were selected based on these four perspectives. Concerning powder injection parameters, although the powder injection angle affects particle temperature and velocity, and the injection distance influences particle divergence width, research indicates that their impact on coating performance is significantly less than that of the carrier gas flow rate (Ref 33). In industrial batch production, once process parameters are established, they are less susceptible to influences from the working environment and nozzle manufacturing errors. Therefore, this paper does not consider the injection angle and injection distance as deviation variables but focuses solely on the carrier gas flow rate as the variable for deviation.

Table 1 presents the detailed parameters of the APS (atmospheric plasma spraying) process in the experiment. Fig. 1 displays samples from the one-dimensional morphology training set. As the spraying state may undergo slight changes with an increase in nozzle working time during spraying, standard process parameters are consistently employed in the creation of standard samples. A new nozzle is utilized to consecutively spray S-1 to S-20, capturing not only the subtle variations in the spraying state but also enhancing the diversity of standard samples. In the preparation of abnormal samples, the impact of nozzle working time growth is considerably less significant than the influence of nozzle health status and process parameters on the coating. Therefore, four nozzles with varying degrees of burnout are used for the preparation of UB-1 to UB-4, and four nozzles exceeding their service life are employed in the creation of UO-1 to UO-4. For other types of samples that deviate from the standard process parameters, because the range of process parameters can be set is wider than the health status of the nozzle, new nozzles are used in the preparation, and 3 parameters are set up above and below the standard value, and 6 samples are prepared for each type. Finally, a total of 46 training set samples were obtained.

Table 1 APS process parameters of training set samples
Fig. 1
figure 1

One-dimensional morphology samples of the training set

The coating morphology of the one-dimensional standard samples of fixed-point spraying under the standard process is almond-shaped, and the microscopic profile of the coating morphology under other deviation processes is greatly different from the profile under the standard process, so it is feasible to diagnose the spraying process by the microscopic profile characteristics of the coating.

Digital Representation

Due to the small quantity of the training set samples, it was difficult to extract image features in order to conduct process diagnosis by using image processing models commonly used in deep learning (such as convolutional neural networks, recurrent neural networks, etc.), so this design attempted to extract the features from the three-dimensional morphology of the coating, and to analyze the dataset by using machine learning algorithms applicable to small sample sizes.

There is a coupling between plasma and powder during the forming process of APS thermal barrier coatings, which leads to a Gaussian distribution on the bonding surface of the coating. Therefore, the analysis of Gaussian curves on the coating is essential for research on coating performance. The Gaussian function is as follows:

$$f\left( x \right) = Ae^{{ - \frac{{\left( {x - X_{c} } \right)^{2} }}{{2\omega^{2} }}}}$$
(1)

A, Xc and ω are all real numbers, and, A > 0. where A is the peak value of Gaussian curve, Xc is the corresponding horizontal coordinates, and ω is the standard deviation. Fig. 2 is a Gaussian curve parametric plot. In the plot, full width at half maximum FWHM numerically equal to 2.354 * ω, the Gaussian peak area is the area bounded by the curve and the horizontal line.

Fig. 2
figure 2

Gaussian curve characteristic parametric plot

In order to show the differences of the plasma spray process under different process parameters through one-dimensional morphology of the coatings, the Gaussian peak characteristic parameters were associated with the plasma spraying process to construct a characterization relationship, as shown in Table 2.

Table 2 Characterization relationship between Gaussian peak characteristic parameters and the plasma spraying process

The laser 3D profilometer DS1050 (Cognex Corporation, Natick, Massachusetts, America) was used to scan the one-dimensional morphology samples to obtain (X, Y, Z) 3D point cloud data, with each sample consisting of approximately 4500 coordinate points (Fig. 3a). In order to extract more precise 3D morphological data of the coating, surface interpolation was used to reconstruct the original surface in 3D. Based on the initial point cloud space of the samples, a meshed plane of dimensions 150 × 150 was regenerated, and the interpolated meshed surface data were obtained using the cubic interpolation technique in the 3D space (Fig. 3).

Fig. 3
figure 3

The process of coating data extraction

In order to obtain the main feature information of the coating, a new algorithm was designed, which takes the vertical line from the highest point in the coating profile to the sample base plane as the axis, and extracts the data of the cross section curve of the profile at any angle based on this axis. Fig. 4(a) and (b) show the schematic diagram of extracting the cross section data at any angle, and the flowchart of the algorithm, respectively. The principle of the algorithm is shown in Fig. 4. The main idea is to screen the actual three-dimensional shape data points whose distance from the section is less than the given error, and project them onto the section to obtain the two-dimensional data of the section.

Fig. 4
figure 4

Principle of the algorithm for extracting cross section data at any angle ((a) Schematic diagram for extracting cross section data at any angle, (b) the algorithm flowchart for extracting cross section data at any angle)

Figure 5 shows the parameter information of peak height (A), offset (Xc), full width at half maximum (FWHM) and Gaussian peak area (Area) after Gaussian fitting of standard samples and each type of abnormal samples at 0°-180° cross section. It is evident that samples produced by the burnout nozzle and old nozzle exhibit lower A values compared to the standard sample. Furthermore, other parameters for both nozzles show non-smooth transitions across different cross sections, displaying some irregularities. Samples manufactured with deviations in the powder gas flow rate parameter have a significant impact on both A values and FWHM values, thereby influencing the melting state of particles. Deviations in the main gas flow rate parameter and spray power parameter have a pronounced effect on the Xc value, indicating a substantial displacement between the morphological center and the spray center. In contrast, the standard sample maintains consistently high levels for A, FWHM, and Area, with a smaller Xc. The transitions across different cross sections are smoother in the standard sample. Consequently, the coating profile of the standard sample is notably superior to that of the abnormal samples.

Fig. 5
figure 5

The subgraphs (a)-(h) are the peak height (A), offset (Xc), full width at half maximum (FWHM) and Gaussian peak area (Area) of the 0°-180 ° section of the coating profile of the standard sample and various abnormal samples, respectively. Among them, the process parameter deviations of abnormal samples in subgraphs (a), (c), (e) and (g) are set below the standard range, and subgraphs (b), (d), (f) and (h) are set above the standard range.

Figure 6 shows the Gaussian fitting curve of the coating cross section of sample number “S-7” and the original curve. The 0°, 45° and 90° cross section curves indirectly reflect the coupling effect of plasma and powder in the horizontal, vertical and inclined directions, showing the combination of molten particles and the surface, which is highly representative in the coating forming. In order to simplify the characterization parameters of the coating, the A, Xc, FWHM and Area of the three typical sections are used as the 12-dimensional features of the coating after digitization.

Fig. 6
figure 6

No. “S-7” sample coating cross section Gaussian fitting curve and the original curve

Method of Analysis

Principal Component Analysis (PCA) and K-Fold

Due to the limited size of the training dataset and the high dimensionality of the data, employing certain machine learning classification algorithms may lead to overfitting, potentially encountering the “curse of dimensionality” (Ref 34). In such cases, the application of techniques such as Principal Component Analysis (PCA) and K-fold cross-validation becomes crucial for mitigating overfitting and ensuring the model generalizes well across different datasets (Ref 35, 36). PCA is a method that transforms the original data into a set of principal components, representing the most significant structures and variations within the data. This transformation effectively reduces the dimensionality of the data by eliminating redundant or uncorrelated features, thereby enhancing the model’s performance (Ref 37,38,39). Assuming the input data matrix is represented by X,

$$X = \left[ {\begin{array}{*{20}c} {x_{11} } & {x_{12} } & \cdots & {x_{1m} } \\ {x_{21} } & {x_{22} } & \cdots & {x_{2m} } \\ \vdots & \vdots & \ddots & \vdots \\ {x_{n1} } & {x_{n2} } & \cdots & {x_{nm} } \\ \end{array} } \right]$$
(2)

Normalize the matrix X to obtain the normalized matrix Xs:

$$X_{s} = \left[ {X - \left( {1,1, \ldots ,1} \right)^{{\text{T}}} x_{av} } \right] \cdot {\text{diag}}\left( {\frac{1}{{s_{1} }},\frac{1}{{s_{2} }}, \ldots ,\frac{1}{{s_{m} }}} \right)$$
(3)

In (3), \(x_{av} = \left( {x_{av,1} ,x_{av,2} , \ldots ,x_{av,m} } \right)\), and sj is the corresponding standard deviation.

The coefficient matrix R is calculated as:

$$R = \frac{1}{m}X_{s}^{{\text{T}}} X_{s}$$
(4)

Find the m eigenvalues of X and arrange the eigenvalues from largest to smallest as \(\lambda_{1} ,\lambda_{2} , \ldots ,\lambda_{n}\) to obtain the arranged eigenvalue Mσ matrix as:

$$M_{\sigma } = {\text{diag}}\left( {\lambda_{1} ,\lambda_{2} , \ldots ,\lambda_{n} } \right)$$
(5)

The associated eigenvectors can be determined from the eigenvalues. In practical scenarios, the selection of principal component scores is guided by the cumulative contribution to the variance.

The decomposition of Xs yields:

$$X_{s} = T \times P^{T} + E_{s} ,T = X \times P$$
(6)

In Eq 6, P is the principal molecular space loading matrix, T is the score matrix, Es is the residual matrix.

K-fold cross-validation is another essential strategy to address overfitting. It involves dividing the dataset into K subsets, training the model on K-1 subsets, and validating it on the remaining subset. This process is repeated K times, with each subset serving as the validation set exactly once (Ref 36). The overall performance is then averaged, providing a more robust evaluation of the model’s ability to generalize to diverse data.

K-Nearest Neighbor (KNN)

Due to an imbalance in the number of standard and anomalous samples within the training dataset, the application of classification algorithms such as Support Vector Machine (SVM), Decision Tree (DT), and K-means may encounter challenges. However, the K-Nearest Neighbor (KNN) algorithm, being proximity-based, exhibits relative robustness in the context of imbalanced datasets. This resilience stems from its reliance on the distances between samples for classification (Ref 40, 41).

In the classification process, KNN computes the Euclidean distance between a new instance and instances in the training dataset. Given two n-dimensional samples: \(x_{i} = \left[ {x_{i}^{1} ,x_{i}^{2} , \ldots ,x_{i}^{n} } \right]\) and \(x_{j} = \left[ {x_{j}^{1} ,x_{j}^{2} , \ldots ,x_{j}^{n} } \right]\), the Euclidean distance dij is defined as follows:

$$d_{ij} = \sqrt {\mathop \sum \limits_{k = 1}^{n} (x_{i}^{k} - x_{j}^{k} )^{2} }$$
(7)

Subsequently, it selects the closest k instances as neighbors and predicts the class based on their classifications (Ref 42). Given that KNN employs Euclidean distance as the distance metric, the data used for classification can either be the original 12-dimensional dataset or the 2-dimensional dataset resulting from PCA dimensionality reduction.

Exploratory Factor Analysis (EFA)

In order to analyze characteristic dimensions that deviate significantly between the anomalous samples and the standard samples on the coating morphology, Exploratory Factor Analysis (EFA) was applied. EFA is a multivariate statistical method aimed at expressing the complex relationships between combinations of underlying factors by decomposing the covariance matrix between variables (Ref 43, 44). The matrix form is given by:

$$X = AF + \varepsilon$$
(8)

In (Eq 8),

$$X = \left[ {\begin{array}{*{20}c} {x_{1} } \\ {x_{2} } \\ \vdots \\ {x_{p} } \\ \end{array} } \right],A = \left[ {\begin{array}{*{20}c} {a_{11} } & {a_{12} } & \cdots & {a_{1m} } \\ {a_{21} } & {a_{22} } & \cdots & {a_{2m} } \\ \vdots & \vdots & \ddots & \vdots \\ {a_{p1} } & {a_{p2} } & \cdots & {a_{pm} } \\ \end{array} } \right],F = \left[ {\begin{array}{*{20}c} {F_{1} } \\ {F_{2} } \\ \vdots \\ {F_{m} } \\ \end{array} } \right],\varepsilon = \left[ {\begin{array}{*{20}c} {\varepsilon_{1} } \\ {\varepsilon_{2} } \\ \vdots \\ {\varepsilon_{p} } \\ \end{array} } \right]$$
(9)

We denote F as the common factors or latent factors of X, A as the factor loading matrix, and ε as the unique factors of X. A = aij, where aij represents the factor loading. Mathematically, it can be proven that the factor loading aij is the correlation coefficient between the i-th variable and the j-th factor, reflecting the importance of the i-th variable on the j-th factor.

In the algorithmic principle of the method, the eigenvalues of the correlation matrix correspond to the ideally chosen coordinate axes, while the eigenvectors represent the orientation of the data in the rotated coordinate system. The eigenvalues reflect the variance of the data in the corresponding dimension, while larger eigenvalues imply that the corresponding eigenvector direction contains more information. By analyzing the factor loadings of factors with large eigenvalues, it is possible to calculate the extent to which the original data contributes to the underlying factors, thus better describing the variability in the data (Ref 45, 46). By combining the Gaussian feature covariate characterization relationship with the factor loadings generated by the EFA model, the factor analysis can be performed by combining data from different sample sets, e.g., standard samples and anomalous samples under a certain deviation process, to find out the feature dimensions of the deviation process that have the greatest potential impact on the original data, and then to derive the dimension that has the greatest potential impact of each deviation process on the physical changes of the process, and thus to build a model of the relationship among the parameters of the plasma spraying process, the process, and the coating’s three-dimensional topographic characteristics (shown in Fig. 7) in order to predict and optimize the coating’s morphological characteristics, as well as to achieve better process control and quality improvement.

Fig. 7
figure 7

Relationships between plasma spray process parameters and plasma spray process and the 3D morphological characteristics of coatings

Results and Discussion

The Preparation of the Testing Sample Set

In order to use fewer test samples to reflect the morphological characteristics of the standard and anomalous classes of coatings, and then verify the effectiveness of the diagnostic model, Fig. 8 shows the set of test samples, including 2 samples of standard class (TS-1-TS-2), 1 sample of burnt nozzle (TB-1), 1 sample of old nozzle exceeding the service life (TO-1), 2 samples of abnormal powder carrier gas flow (TF-1-TF-2), 2 samples of abnormal main gas flow (TA-1-TA-2), and 2 samples of abnormal plasma gun power (TP-1-TP-2), making 10 cases in total (For the specific process parameter settings of the test set, refer to the training set parameters in Table 1, taking the median of the upper and lower deviation ranges for each type of anomaly sample).

Fig. 8
figure 8

Test sample set

Classification Prediction Algorithm Validation

Firstly, to mitigate the impact of different scales within the dataset, all sample data is standardized using the z-score. In order to select appropriate principal components that effectively eliminate redundant information in multidimensional data and prevent model overfitting, the covariance matrix of the standardized dataset is computed. The eigenvalues are then arranged in descending order, and the cumulative variance contribution corresponding to each eigenvalue (principal component) is illustrated in Fig. 9. Choosing 2 to 3 principal components captures a substantial portion of the information (variance contribution). For a more concise extraction of meaningful information and ease of visualization, 2 principal components are selected.

Fig. 9
figure 9

Cumulative Variance Contribution of Principal Components (Eigenvalues of the Covariance Matrix)

Figure 10 shows the test set of the PCA and KNN, through the PCA method, with sample data from 12-dimensional features down to two-dimensional features. Through the visualization, as shown in Fig. 10(a), which reflects the distribution of the training samples and test samples of each type after the reduction to 2 dimensions, there is a certain clustering tendency in the two-dimensional data distribution of the standard class and anomalous class samples after the reduction of dimensionality. Therefore, there is not only a big difference in the microscopic morphology of coatings, but there is also a large amount of information even after the digitization and data dimensionality reduction, so it is feasible to digitalize the samples, and the PCA algorithm is well validated.

Fig. 10
figure 10

Test set PCA and KNN

Using the data after dimensionality reduction to validate the KNN algorithm, Fig. 10(b) shows the data visualization of the PCA-KNN test sample set. The species of the three training set samples closest to the test samples can be determined by calculating the Euclidean distance between the test samples and the training set samples, then prediction of the species of the test samples is available. After calculating, the accuracy of the KNN algorithm in predicting the test samples using the data after dimensionality reduction is 100%, and the accuracy of the prediction using the 12-dimensional original data before dimensionality reduction is 90%. Table 3 displays the actual categories, predicted results, and accuracy of the classification algorithm for each test sample.

Table 3 Actual categories of test samples and predicted results from the classification algorithm

Due to the relatively small sample size, potential overfitting may exist in this training set. The PCA-KNN model is tested using K-fold cross-validation, as illustrated in Fig. 11. For each fold, the confusion matrix is obtained, and performance metrics, including accuracy, precision, recall, and F1 score, are calculated. Specifically:

$${\text{Accuracy}} = \frac{{{\text{Number}}\;{\text{of}}\;{\text{correct}}\;{\text{predictions}}}}{{{\text{Total}}\;{\text{number}}\;{\text{of}}\;{\text{predictions}}}}$$
(10)
$${\text{Precision}} = \frac{{{\text{True}}\;{\text{positives}}}}{{{\text{True}}\;{\text{positives}} + {\text{False}}\;{\text{positives}}}}$$
(11)
$${\text{Recall}} = \frac{{{\text{True}}\;{\text{positives }}}}{{{\text{True}}\;{\text{positives}} + {\text{False}}\;{\text{negatives}}}}$$
(12)
$${\text{F1 - score}} = 2 \times \frac{{{\text{Precision}} \times {\text{Recall}}}}{{{\text{Precision}} + {\text{Recall}}}}$$
(13)
Fig. 11
figure 11

(a) to (e): PCA-KNN model is tested using F-fold cross-validation, with each fold producing a confusion matrix. (f): For each fold, the metrics of accuracy, precision, recall, and F1 score are calculated.

After conducting a 5-fold cross-validation, the PCA-KNN model demonstrated an average accuracy of 94.7%. Additionally, the other three relevant metrics exhibited high values, suggesting its capability to perform well across various data subsets. These results highlight the model’s robust generalization ability and feasibility for practical applications.

Process Feedback Method

It can be known from the principle of the EAF model in the “Method of analysis” section of this article, the eigenvalues of the covariance matrix represent a measure of the variance of the data in the corresponding dimension on the rotated coordinate system. The samples’ factor analysis of standard and deviation process (abnormal powder feed gas flow) is shown in Fig. 12, and in Fig. 12(a), it can be seen that larger eigenvalues correspond to eigenvectors in the direction of data with greater variability, and four factors can cover the majority of the amount of information of the original data. Combining the magnitude of the eigenvalues of the selected factors and the factor loadings generated by the EFA model (shown in Fig. 12b), it is possible to get the raw characteristics of the main effects of this deviant process.

Fig. 12
figure 12

Factor analysis of samples from standard and deviant processes (abnormal powder feed gas flow), ((a) Plot of factor number and eigenvalue relationship. (b) Heat map of EFA factor loadings)

The ranking of the influence of feature parameters under different process parameters based on the EFA model is shown in Fig. 13, and as shown, it analyzes all the factors of anomalous samples and standard samples under deviated processes to get the relationships between deviated process parameters and Gaussian peak feature parameters of the coating morphology, which can reveal the influence of different types of deviated process samples on the coating morphology and establish a correlation model between them (the radial data in Fig. 13 indicates the ranking of the contribution of the feature parameters).

Fig. 13
figure 13

Ranking of influence of characteristic parameters under different process parameters based on EFA model

In practical engineering applications, with the comprehensive prediction classification algorithm and the EFA model, before workpiece spraying, the fixed-point APS process can be used to produce test specimens, according to the PCA-KNN algorithm to determine whether the process meets the requirements of the standard process. If the requirements are met, the workpiece can be coated. Otherwise, it is necessary to obtain the original features with the largest difference between the test piece and the sample under the standard process, and combine the relationship model of process parameters-process-coating three-dimensional morphology characteristics established by EFA to adjust the relevant spraying process parameters until the new test piece is judged to meet the standard process.

Conclusions

Based on machine learning algorithms, this study proposes a process quality diagnosis method for thermal barrier coatings using macroscopic three-dimensional surface features obtained through atmospheric plasma spraying. The main conclusions are as follows:

  1. 1.

    After reconstructing the three-dimensional morphology of one-dimensional coating samples, the cross sectional morphology exhibits a Gaussian peak distribution. The study establishes a correlation between Gaussian peak feature parameters and process characteristics, enabling a digital analysis of feature parameters for standard and anomalous process samples, with extraction of typical feature data for analysis.

  2. 2.

    The effectiveness of the classification algorithm was assessed, with the PCA-KNN algorithm demonstrating high accuracy in the classification and diagnosis of process quality for coated samples. Following K-fold cross-validation, it achieved an average accuracy of 94.7%.

  3. 3.

    By standardizing the coating material, equipment, and developing a standard process, this study utilizes exploratory factor analysis to establish a correlation model between process parameters and coating characteristic morphology, enabling the diagnosis of process abnormalities. Furthermore, through additional data accumulation and secondary development, the model can be swiftly applied to different plasma spraying coatings, facilitating the rapid diagnosis of coating abnormalities in the mass production process.