Influence of optimization techniques on machine learning algorithms: compressive behaviour of additively manufactured poly lactic acid (PLA) for structural applications

Veeman, Dhinakaran; Vellaisamy, Murugan; Ponnusamy, Pradeep Castro; Subramaniyan, Mohan Kumar; Vijayakumar, M. D.; Guo, Lei

doi:10.1007/s40964-024-00770-2

Influence of optimization techniques on machine learning algorithms: compressive behaviour of additively manufactured poly lactic acid (PLA) for structural applications

Full Research Article
Published: 04 September 2024

(2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Progress in Additive Manufacturing Aims and scope Submit manuscript

Influence of optimization techniques on machine learning algorithms: compressive behaviour of additively manufactured poly lactic acid (PLA) for structural applications

Download PDF

Dhinakaran Veeman ORCID: orcid.org/0000-0001-6604-8910¹,
Murugan Vellaisamy¹,
Pradeep Castro Ponnusamy¹,
Mohan Kumar Subramaniyan¹,
M. D. Vijayakumar¹ &
…
Lei Guo²

15 Accesses
Explore all metrics

Abstract

Additive manufacturing is familiar among modern manufacturing techniques due to its flexibility, efficiency in material usage and ability to manufacture intricate and complex structures. This research uses the Fused Deposition Modelling (FDM) technique to print the Polylactic Acid (PLA) material for testing the compressive strength. There are numerous process parameters which affect the quality of the product available to manufacture the component through FDM. Infill density, Infill Pattern, and Layer height have been varied at three levels, and a total of 81 compressive samples have been tested. The compression test was conducted with a strain rate of 0.05 mm/min. The highest compressive strength of 71.77 MPa was measured for 0.1 layer height, 100% infill density, 90-degree raster angle, and infill line pattern. The lower layer height seems to have higher compressive strength. Machine learning algorithms have been employed to understand the complicated relations between the process parameters. Optuna and GridSearchCV optimization techniques have been used to tune the hyperparameter to produce better results and predictions. Based on the Mean Squared Error (MSE) and R² values, it is found that the Optuna optimization techniques are performing better than GridSearchCV for this data set. Support Vector Regression (SVR) is observed to be a poor-performing model with and without optimization techniques. CatBoost constantly beats the other models, such as Linear Regression, Decision Tree, SVR, and AdaBoost XGBoost, by having the lowest Mean Squared Error and R² score. At the same time, Optuna and GridSearchCV optimization techniques are used. This research work will help the research community and the users of additive manufacturing to predict the behaviour of different process parameters and the influences of these parameters to predict the compressive strength of the additive-manufactured materials.

Optimizing flexural strength of fused deposition modelling using supervised machine learning algorithms

Article 09 June 2023

A Comparative Study of Artificial Neural Network and Regression Model for Hybrid Additive Manufacturing of Ti6Al4V Parts and Microstructural Analysis

Article 02 August 2024

Evolutionary AI-Based Algorithms for the Optimization of the Tensile Strength of Additively Manufactured Specimens

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The manufacturing process has to enrich its capability to meet the demand that arises due to the increasing population and preferences of individuals. The intricate, dynamic, and chaotic behaviours impose constraints on the manufacturing system [1], and hence, the manufacturing of high-quality products in a fruitful way with the available resources is essential. One of the important objectives is manufacturing the product at the lowest possible cost [2].

The global landscape of manufacturing gives away a lot of challenges that require strategic planning and innovative solutions. These challenges admit the fostering of advanced manufacturing technology to enrich efficiency and competitiveness. They also admit the increasing consequences of manufacturing high-value order products to meet growing market needs and that it is essential to force advanced knowledge, information management, and artificial intelligence systems for sustainability growth [3]. Sustainable manufacturing practices and products are highly important in meeting environmental interests and assuring long-term viability. Moreover, the agility and adaptability of enterprise capabilities and supply chains are essential for implementing driving market conditions [4].

Innovation in products, services, and processes is a motivational force for keeping ahead in a competitive market while nurturing tight collaboration between industry and research that advances the implementation of modern techniques and encourages continuous improvement. Taking up a new manufacturing blueprint is highly needed for steering the difficulties of the modern industrial landscape and attaining sustainable progress in a fast-growing global economy [5].

The traditional manufacturing system finds it difficult to face the presumptions of the contemporary industry. This is due to the inherent limits of the traditional system for manufacturing complex parts. The elegant design and intricate geometry of modern products exceed the proficiency of traditional manufacturing processes in the situation [6, 7]. Additive manufacturing rises to be a game-changing scenario for this mystery. Additive manufacturing gives extraordinary adaptability and precision in manufacturing the components that were not possible or challenging through the traditional processes. This is done by stacking the material to build up the parts based on digital design [8].

This technology transforms the manufacturing of complicated components in all industries by not only controlling the limitations of traditional manufacturing but also generating new opportunities for producing customized components and on-demand production. With the development of additive manufacturing, manufacturing capabilities are stepping into a new age by noticeable creativity and competency rather than complexity acting as a hurdle [9].

Additive manufacturing manufactures 3D objects from digital blueprints by layer-by-layer deposition. Fused Deposition Modeling (FDM) is one of the most affordable and commonly available techniques among all the additive manufacturing processes. The materials that are primarily used are metals, powders, polymers, and resin to manufacture intricate components. Fused Deposition Modeling makes use of thermoplastic and thermoelastic materials [10,11,12]. To easily manufacture prototypes and complex functional components at a faster rate, FDM is a convenient and economical way. It is the best choice for small-scale industries, educational institutes, and researchers. FDM is highly useful for investigating the possibilities of additive manufacturing. The installation of this facility requires a low investment with minimal material waste, ease of use, and simple setup [13,14,15]. FDM is a valuable technique in the field of additive manufacturing, because it ensures a good level of adaptability and reliability despite its affordability. It finds a wide range of applications across sectors [16, 17].

There are numerous process parameters in the FDM process, and the complex relation between the parameters strongly influences the quality and functionality of the component. The temperature of filament extrusion, speed of deposition, layer height, and infill density play an important role in getting the final products [18, 19]. The overall quality and consistency of the printed products are also governed by ambient temperature and humidity. Calibration of bed levelling and the distance between the nozzle and the bed will have an equal influence; a thorough knowledge of the FDM process and the ability to meticulously balance these many variables to steer this complex set of parameters are needed to avail the final product [20, 21]. FDM users can fully understand the potential of this process by carefully adopting and optimizing these parameters, resulting in the manufacturing of high-quality, functional parts that meet the requirements of several industries and applications [22, 23]. Hence, an understanding of these process parameters is highly required to move towards successful manufacturing.

Traditional modelling approaches are often challenged by the intricate interconnections between the abundance of parameters connected in FDM printing and the end product’s attributes. The complex and non-linear nature of these interconnections provides a major hurdle for manufacturers trying to sleek their operations and reliably create high-quality parts. However, the development of machine learning algorithms provides a thorough way out of this conundrum. These enlightened algorithms can highly apprehend the complex interconnections between the different printing process parameters, material attributes, and the intended results by employing the power of data-driven learning.

Machine learning models are capable of revealing patterns and intuitions that would be challenging to identify with the conventional mathematical relations by looking over large datasets. This creates new possibilities for additive manufacturing in terms of consistency, quality, and efficiency by permitting businesses to fine-tune their FDM processes with previously unprecedented accuracy. Machine learning is a rapidly evolving area, and its combination with additive manufacturing has a huge prospective to help industries shoot the limits of 3D printing and hammer out the complexity of this ever-changing environment. Most of the research works are available in the development of various machine learning models for predicting the property. However, not many more works have been presented to understand the effect of different hyperparameter optimization techniques on improving the metrics for better prediction. Hence, an initiative is taken through this research work to develop a machine learning model considering GridSearchCV and Optuna hyperparameter optimization techniques to understand the behaviour of process parameters and predict the compression strength of additive-manufactured specimens.

2 Machine learning in manufacturing

The industrial sector is moving through a revolutionary period as n outcome of the advent of the digital age, and the implementation of machine learning algorithms is now crucial to growth and innovations.

These enlightened computational tools have become the genius of the modern shop floor, setting a euphonious interplay between process optimization, quality control, and predictive maintenance, much like a harmony of data and algorithms. The flexibility of machine learning has become a motivational force in the stalking of manufacturing greatness, from the accuracy of supervised learning models in foreseeing product attributes to the investigational ability of unsupervised models in disclosing concealed patterns [24, 25].

The manufacturing, testing, and adaptability of the machining learning process are shown in Fig. 1. These data-driven algorithms have materialized as the industry’s ambit, opening new routes to effectiveness, agility, and resilience as they thrash out the challenges of untamed international competitiveness and constantly modulate consumer demands. In this new area, the possibilities are infinite, and the products of the future will be manufactured with the accuracy and perception that only machine learning can provide. This has been made possible by the unification of human knowledge and machine intelligence [26, 27].

Machine learning algorithms are mainly clustered into linear and non-linear algorithms. The foundation of linear algorithms is the assumption that there is a linear relationship between the input features and goal variables. To make smoother, simpler explication and forecasting, these algorithms search for identifying the linear function that best meets the data. Non-linear algorithms do not make the presumption of a linear relationship between the input features and the goal variable. The data can have more complicated, non-linear patterns modelled by these methods.

ML models are capable of carrying out operations, including dimensionality reduction, grouping, regression, and classification [28, 29]. In this research work, linear regression, a well-established technique from the linear models category, has been used to assist as a baseline for prediction. In addition, we employed a variety of non-linear models, such as support vector machines (SVM), XGBoost, decision trees, random forests, and AdaBoost, to capture the potentially intricate relationships between the influencing factors and the compression behaviour of additively manufactured PLA material.

2.1 Linear regression

Linear regression is a fundamental statistical method used in machine learning and data science to predict a continuous outcome variable based on one or more predictor variables. It gets its name as it assumes a linear relationship between the input variables and the single output variable. It mathematically models the unknown or dependent variable and the known or independent variable as a linear equation. Linear regression models are relatively simple and provide an easy-to-interpret mathematical formula to generate predictions. Linear regression is an established statistical technique that is easily applied to software and computing. Many fields, including biology and the behavioural, environmental, and social sciences, employ linear regression to conduct preliminary data analysis and predict future trends. Many data science methods, such as machine learning and artificial intelligence, use linear regression to solve complex problems [30].

2.2 Support vector regression (SVR)

Support Vector Regression (SVR) is a type of Support Vector Machine (SVM) that is used for regression problems. Unlike linear regression, which aims to minimize the error between predicted and actual values, SVR aims to find a function that deviates from the actual observed values by a value no greater than a specified margin. SVR uses the concept of a hyperplane and margin, but their definitions are different. In SVR, the margin is defined as the error tolerance of the model, which is also called the ε-insensitive tube. This tube allows some deviation of the data points from the hyperplane without being counted as errors. The hyperplane is the best fit possible to the data that fall within the ϵ-insensitive tube. SVR can be mathematically formulated as a convex optimization problem. The objective of the problem is to find a function f(x) that is as flat as possible while having a maximum deviation of ε from the actual targets for all the training data. The flatness of the function implies that it is less sensitive to small changes in the input data, which reduces the risk of overfitting [31].

2.3 Decision tree (DT)

A decision tree is a non-parametric supervised learning algorithm which is utilized for both classification and regression tasks. It has a hierarchical tree structure, which consists of a root node, branches, internal nodes, and leaf nodes. Decision tree learning employs a divide-and-conquer strategy by conducting a greedy search to identify the optimal split points within a tree. This process of splitting is then repeated in a top–down, recursive manner until all or the majority of records have been classified under specific class labels. Pruning techniques in decision trees are essential to enhance the model’s generalization capability and prevent overfitting, which occurs when the tree captures noise in the training data rather than the underlying patterns. Pruning can be categorized into two main types: pre-pruning and post-pruning. Pre-pruning, also known as early stopping, involves halting the tree growth at an early stage by setting conditions, such as a maximum tree depth, a minimum number of samples required to split a node, or a minimum number of samples required to be at a leaf node. By imposing these constraints, pre-pruning reduces the complexity of the model and thus mitigates overfitting. Post-pruning, on the other hand, allows the tree to grow to its full depth and then removes nodes that contribute little to the predictive power of the model. This is done by evaluating the impact of removing certain branches on the model’s performance, typically using metrics like cost complexity pruning. Post-pruning techniques include reduced error pruning, which removes nodes if their absence does not reduce model accuracy on a validation set, and cost complexity pruning, which prunes the tree by considering a trade-off between the complexity of the tree and its fit to the data. Both pre-pruning and post-pruning aim to create a balance between model complexity and predictive accuracy, ensuring the decision tree remains interpretable while effectively generalizing to unseen data [32].

2.4 XGBoost

XGBoost (eXtreme Gradient Boosting) is an advanced implementation of the gradient-boosting machine learning algorithm designed for speed and performance. Developed by Tianqi Chen, XGBoost provides an efficient and scalable framework for tree boosting, which is particularly powerful for structured/tabular data [33]. The algorithm uses an ensemble of decision trees to improve predictive accuracy through iterative boosting, where each new tree corrects errors made by the previous ones. Key features of XGBoost include parallelization for faster computation, handling of missing values, and regularization techniques to prevent overfitting. Its ability to manage large datasets and support custom optimization objectives and evaluation criteria makes XGBoost a preferred choice for many data scientists and machine learning practitioners [34].

2.5 AdaBoost

AdaBoost, short for Adaptive Boosting, is an ensemble learning algorithm that combines multiple weak classifiers to form a strong classifier. Developed by Yoav Freund and Robert Schapire in 1996, AdaBoost works by sequentially training weak learners, typically decision trees with a single split (decision stumps), on the weighted versions of the dataset [35]. After each iteration, the algorithm adjusts the weights of incorrectly classified instances, increasing their importance in the next round. This process helps subsequent classifiers focus on the harder-to-classify instances. AdaBoost’s ability to improve the performance of weak learners while maintaining simplicity and interpretability has made it a widely used technique in machine learning. However, it is sensitive to noisy data and outliers, which can significantly affect its performance.

2.6 CatBoost

CatBoost (Categorical Boosting) is a gradient-boosting algorithm specifically designed to handle categorical features effectively. Developed by Yandex, CatBoost aims to provide high performance and ease of use, particularly for datasets with a significant number of categorical variables [36]. Unlike traditional gradient-boosting algorithms, which require extensive preprocessing of categorical data, CatBoost automatically processes categorical features during training, thereby reducing the need for manual feature engineering. This is achieved through techniques like target-based statistics and efficient handling of categorical splits. CatBoost also includes features like ordered boosting, which mitigates overfitting and is known for its robustness and speed [37]. Its ability to handle categorical data without extensive preprocessing makes it particularly valuable for tasks involving tabular data with mixed types of features.

3 Materials and methods

PLA is utilized in a wide range of 3D printing applications, including medical equipment, food packaging, injection moulding, and general prototyping. Its biodegradability and biocompatibility make it ideal for applications such as implanted devices.

PLA has been sourced from WOL 3D. It has been used for several small- and large-scale applications based on its specific strength. The prediction of the compression strength of PLA will further enhance its usage in different fields. Some of the special applications of PLA where its compressive behaviour plays a vital role are brackets, load bearings members, partitions, non-load-bearing walls, and interior wall panels. It is also used in a scale to hold the medicine in plants during healing.

The most popular type of additive manufacturing (AM) is material extrusion, and the most popular method of this type of AM is desktop-scale thermally driven fused deposition modelling [38]. A variety of process parameters are crucial in explicating the quality and attributes of additively manufactured components using fused deposition modelling (FDM).

In this research work, PLA has been chosen as the candidate material. ASTM D695 has been used to fabricate samples with the dimensions of 15 × 10 × 5 mm [39, 40]. The samples were fabricated using the PRATHAM 3.0 (India), a multi-purpose 3D printer. It comes with a silicon pre-heated build plate, which lowers the model warpage during manufacture. The machine can produce complex geometries fast and components up to 300 mm × 300 mm × 300 mm in size. The Fabrication of Samples in the FDM Printer is shown in Fig. 2. UltiMaker Cura 5.8 slicing engine has been used for slicing.

3.1 Process parameters

These parameters include machine settings as well as printing parameters. Nozzle temperature, bed temperature, and diameter of the nozzle are coming under the machine settings category. Layer thickness, raster angle, infill %, build orientation, and printing speed are coming under the printing parameters category. Material composition, extrusion speed, and temperature are material (filament) features that will have a noteworthy influence on the printing outcomes. Part quality, mechanical characteristics, dimensional accuracy, surface finish, productivity, and energy efficiency will be critically impacted by these complex relations. To have high quality and efficiency in FDM 3D printing processes, this complex seat of process parameters has to be optimized successfully.

Among the large process parameters, infill, infill pattern, raster orientation, and layer thickness have an extensive impact on 3D printing results. Figure 3 illustrates the selected process parameters, and Table 1 shows the variations of levels in the process parameters.

Table 1 Process parameters and their variations

Full size table

Further, these parameters will have a noteworthy impact on the final product’s strength, durability, weight, material usage, print time, cost, finish, and printability. Understanding and adjusting these process parameters require attention to manufacturing high-quality, efficient, and cost-effective components. This research work highlights the complicated relations between these parameters and their effects on the compressive strength of the printed parts and notes the significance of rigorous process parameter management and optimization.

3.2 Evaluation of compressive strength

The printed samples have been subjected to compressive tests. The test was conducted in the Tinus Olsen Universal Testing Machine that can test the materials with a 50 kN load cell connected with a data acquisition system to get real-time data and store it. The strain rate has been maintained at 0.5 mm/min. The testing of additive-manufactured PLA and the overall methodology followed in the research are shown in Figs. 4 and 5, respectively.

4 Results and discussion

In this research work, additive manufacturing of PLA compressive specimens was printed using the FDM technique. Infill, Infill pattern, layer height and Raster Orientation have been varied during the printing of the specimens. Linear Regression, Support Vector Regression, Adaboost, XGBoost, CatBoost, and Decision tree machine learning models have been employed to understand the relationship between the process parameters and predicting the ultimate compressive stress. Python code has been developed to predict the performance metrics without hyperparameter optimization. GridSearchCV and Optuna optimization techniques were used to predict performance metrics individually. The measured compressive strength of different process parameter combinations is shown in Table 2.

Table 2 Compressive strength of different process parameter combinations

Full size table

4.1 Effect of input parameters on compressive strength

The box plots (Fig. 6) reveal that most of the data points and the median for compressive strength are situated higher at 0.1 mm layer height as conflicting with 0.2 mm and 0.3 mm layer heights. This indicates that when the material is printed with 0.1 mm layer height, it has a higher compressive strength. The reason for notifying the higher compressive strength is that a stronger bonding could have been formed between the layers, which might have fused and yielded the higher compressive strength. This will result in good load-bearing capacity when subject to compressive loading. When the layer thickness (0.2 and 0.3 mm) is increased, the bonding between the layers may not be as good as the one printed with the lower thickness (0.1 mm), which could be the reason for the decreased compressive strength. Overall, the infill density height has a positive correlation with compressive, and the Raster orientation does not have any effect on Compressive strength. Among the Line, cubic and TriHexagonal infill Patterns, the compressive strength is higher for TriHexagonal followed by cubic and lines.

Figure 7 presents the pair plots of the effect of input parameters on compressive strength. Infill density, layer height, infill pattern, and raster orientation are all important aspects in determining the ultimate stress or strength of 3D printed items. They play a crucial role in determining the compressive strength of printed components. Infill density has the greatest apparent outcome, with higher infill ensuing in greater strength amongst all other parameter combinations. The influences of layer height, infill pattern, and raster orientation are more complicated and interrelated. Increasing layer height can somewhat reduce strength for some infill patterns, such as linear lines, most likely due to weaker bonding between thicker layers.

Nevertheless, more sophisticated infill patterns, such as TriHexagonal, show less fluctuation in strength across layer height. The infill pattern is a significant effect, with TriHexagonal typically surpassing linear infill in terms of ultimate stress. Raster orientation also has an effect, with 0 and 90 degree orientations yielding stronger linear infill than 45 degrees. However, the TriHexagonal pattern is less affected by the raster angle. Overall, optimizing the combination of these 3D printing settings is critical for increasing the strength and performance of the finished part.

4.2 Hyperparameter optimization of GridSearchCV and Optuna

Table 3 shows the best hyperparameter optimization parameters for GridSearchCV and Optuna. The best parameter for both the optimization of the decision tree algorithm indicates that the model is performing well when the maximum depth is limited to 4 levels. When we go deeper, it will lead to overfitting, while shallow depth will lead to underfitting.

Table 3 Best parameters (GridSearchCV and Optuna)

Full size table

In support vector regression (SVR), C is the parameter that controls the regularization strength. Here, the value of C is 10 for GridSearchCV optimization, which seems to be higher, indicating that the model should prioritize fitting the training data closely. Epsilon indicates the width of the epsilon tube, and the lower value indicates the narrow width of the tube and reflects the cause indicated by the C parameter. Optuna discovered somewhat different values for C and epsilon than GridSearchCV did, suggesting that even tiny changes to these hyperparameters could have an impact on the model's performance.

The learning rate controls the contribution of each model to the final combination. More aggressive boosting is the outcome of a higher learning rate, whereas more conservative boosting is the result of a lower learning rate. The number of weak learners (base models) to be trained successively is determined by the number of estimators (n_estimators). An excess of estimators may cause overfitting, whilst an insufficient number could cause underfitting. Optuna discovered various values for the learning rate and the number of estimators, indicating that it looked into a different hyperparameter space and discovered a different setup that reduced the objective function.

XGBoost has settings for learning rate and number of estimators, just like AdaBoost. To prevent overfitting, the learning rate is moderated, and the maximum depth of the trees is set to 3. This indicates a relatively shallow tree structure. In contrast to GridSearchCV, Optuna.

identified a different set of hyperparameters, suggesting that it may have searched a larger area and found a setup that more successfully minimized the objective function.

The parameters of CatBoost are learning rate, depth, and number of iterations. Like max_depth in other models, depth regulates the trees’ depth. During the optimization process, the step size is governed by the learning rate. In comparison to GridSearchCV, Optuna discovered distinct values for depth, l2_leaf_reg, border_count, learning rate, and iterations, indicating a more thorough examination of the hyperparameter space.

Figure 8 shows MSE and R² values of Training for each model with and without Optimization. When compared to running multiple models without optimization, optimization approaches like GridSearchCV and Optuna have demonstrated considerable gains in the performance metrics (MSE and R² Score).

Linear Regression was not subjected to optimization approaches. The performance of linear regression is consistent with both the test and train datasets. It is understood from the Figure that a lower MSE and a marginally higher R² score suggest that Optuna optimization outperforms GridSearchCV optimization for Decision Tree performance on the test dataset.

With continuously high MSE and low R² scores across all optimization techniques, SVR performs poorly when compared to other models. Moreover, optimization techniques do not significantly enhance SVR’s performance, suggesting that SVR may not be the best option for this dataset. Compared to GridSearchCV, AdaBoost performs better when using optimization techniques, especially with Optuna, where it obtains lower MSE and higher R² scores. This suggests that AdaBoost’s performance can be improved by tweaking the hyperparameter. Among all the models, CatBoost performs the best, gaining the lowest MSE and the greatest R² scores using both Optuna and GridSearchCV optimization techniques.

This indicates that CatBoost performs excellently for this dataset, and optimization techniques, particularly with Optuna, meaningly progress its performance. The reason for CatBoost’s higher performance is its resistance to overfitting, ability to accomplish the missing information, and backing up regularization strategies.

Furthermore, these model-boosting algorithms iteratively enhance performance by concentrating on hard-to-predict cases. For all optimization situations, CatBoost performs marginally better than XGBoost in terms of MSE and R² score, making it the optimal model choice for this dataset. Because CatBoost has better predictive performance and model fit than the other options, it would be the recommended model selection for this problem based on the results that have been supplied.

Figure 9 shows the comparison of MSE with respect to the test and train data to understand the performance. It is seen that the MSE of test data for all the model’s test data without optimization seems to be higher, which indicates that all the model behaviours are worse on the unseen data. However, the value of MSE is consistent for all the train data models, which indicates that there is no considerable overfitting or underfitting. The MSE value of Optuna-optimized models is lower when compared with the non-optimized counterparts on dates that the performance has been increased.

Since the MSE value of train data is lower when compared with non-optimized values, it states that the train data have flitted well when optimizing with Optuna. The GridSearchCV optimization has yielded lower MSEs like Optuna for the test data compared to non-optimized models. The values of training data MSE are comparable between the Optuna and GridSearchCV. The train data have been fitted to similar models.

Figure 10 shows the R² values of train and test data without optimization and with optimization. When we compare the R² values of all the models without optimization, the optimized model is high.

This indicates that the model experiences a portion of the variance of test data without optimization. However, the value of R² of the train data is consistent with the test data, confirming that there is either no underfitting or overfitting. The R² values of the Optuna-optimized models with a counterpart of non-optimized significantly higher indicates the Optuna-optimized models improved the performance of the models significantly on unseen data. The R² values for the train data are also slightly greater than the non-optimized models. This states that the Optuna optimization better fits the train data than the non-optimized models. The R² values of the GridSearchCV optimized models with a counterpart of non-optimized were significantly higher, similar to Optuna on unseen data that compared the results without optimization. However, the performance of the GridSearchCV is slightly worse than the Optuna optimization.

The residual plot (Fig. 11.) shows the distribution of the predicted values for different models. It is visible from the plot that the values for the linear regression model are scattered around the actual values. The predicted values are sometimes closer to the actual values. This denotes that the model has made efforts to understand the relation between the process variables, but it is ineffective. The decision tree model also has a similar trend, but the predicted values are much closer than the linear regression model. This shows that both models need to be improved to attain even better results. The SVR is a poor-performing model. The plot shows that values are distributed at both extremes. This proves that the model has made no effort to understand the relationship between the process variables. Hence, the SVR model will not be suitable for the prescribed task with a given set of values. The ensemble models (AdaBoost, XGBoost, and CatBoost) have really performed well with the given set of values, as seen from the plot. The AdaBoost and XGBoost models have predicted the values closer to actual values but are scattered. Hence, these models need to be improved further to obtain better results. In the case of the CatBoost model, the predicted values are too close to actual values, which is good. However, in a few cases, the predicted and actual values are the same, which means that the values are overfitting. Hence, optimization techniques are needed to solve this.

Figure 12 shows that improvements in several models were made with Optuna optimization shows. The improvisation is visible from the plot, and the predicted values are closer to actual values than the previous case. Improvisation is seen even in the decision tree model, but the predicted values are not closer to the actual values. In the case of the SVR model, the Optuna optimization does not affect the results. The predicted values are scattered in a common pattern ranging from maximum to minimum. This proves that the SVR model is not suitable for the current task. The use of Optuna optimizer has also made no change in the results. Meanwhile, the ensemble models, AdaBoost, XGBoost, and CatBoost, have significantly improved their results. CatBoost is the best-performing model, as can be seen from the plot. The predicted values are closer to actual values, which indicates that the model has learned the relationship between process variables and output. As a result, the Optuna optimizer has improved the CatBoost model considerably compared to AdaBoost and XGBoost. The plot shows that the CatBoost model has predicted values that are much closer to the actual values. Thus, it can be confirmed that CatBoost with Optuna optimizer is the best-performing model.

The residual plot (Fig. 13) with GridSearchCV shows an improvement in the performance of the model. However, when comparing GridsearchCV performance on the model, it is not up to that of Optuna optimizer. The GridsearchCV has improved the performance of ensemble models, particularly the performance of the CatBoost model. Other ensemble models, AdaBoost and XGBoost, have shown improvements but not like CatBoost. From the plot, it is clear that the predicted values of the CatBoost model are closer to the actual values. This confirms that the model with GridsearchCV optimizer has learnt the relationship between the process variables without memorizing the training data. Thus, the CatBoost model remains the best-performing model even with GridsearchCV. The SVR model seems to be the worst-performing model [41–42], even with GridSearchCV, as the model has failed to show improvements.

5 Conclusion

The increasing population, need for customization, and sustainability make Additive manufacturing a leading technology front in the field of Manufacturing. The complex relationship between the process parameters defines the quality of additive-manufactured parts which will be expensive if we go for experimentation to predict the relationship between the process parameters. Hence, machine learning models have been used to learn the data from the physical experiments and evaluate the metrics to predict the compressive strength of additive-manufactured PLA material.

Infill density, Infill pattern, Raster orientation, and Layer Height have varied at three levels, and samples have been printed. The highest compressive strength of 71.77 MPa was measured for 0.1 layer height, 100% infill density, 90-degree raster angle, and infill line pattern.
Infill density has the greatest apparent outcome, with higher infill ensuing in greater strength amongst all other parameter combinations. The influences of layer height, infill pattern, and raster orientation are more complicated and interrelated.
Increasing layer height can somewhat reduce strength for some infill patterns, such as linear lines, most likely due to weaker bonding between thicker layers.
The infill density height has a positive correlation with compressive, and the Raster orientation does not have any effect on Compressive strength.
Among the Line, cubic, and TriHexagonal infill Patterns, the compressive strength is higher for TriHexagonal followed by cubic and lines.
CatBoost constantly beat the other models such as Linear Regression, Decision Tree, SVR, and AdaBoost XGBoost by having the lowest Mean Squared Error and R² score. At the same time, Optuna as well as GridSearchCV optimization techniques are used.
Considering all, the model’s performance can be highly influenced by the optimization technique designated, mostly for specific algorithms like Decision Tree and AdaBoost. On the other side, some models, such as SVR, have not gained much from optimization, while others, like XGBoost, are reasonably less impacted by it.
To guarantee data quality and consistency, it is first necessary to invest in reliable data collection and preparation methods. This includes creating plans for handling outliers and missing data and investigating data augmentation approaches to broaden the range of data.
When developing and selecting ML models, researchers should take into account the intricacy of the issue at hand as well as the available data. They should also prioritise interpretable models whenever feasible and use regularisation approaches to avoid overfitting.

References

Wuest T, Weimer D, Irgens C, Thoben KD (2016) Machine learning in manufacturing: advantages, challenges, and applications. Prod Manuf Res 4(1):23–45. https://doi.org/10.1080/21693277.2016.1192517
Article Google Scholar
AMFG AI (2019) Industry 4.0: 7 Real-World Examples of Digital Manufacturing in Action
Dingli DJ (2012) The manufacturing industry–Coping with challenges. No. 2012/05
Gordon J, Sohal AS (2001) Assessing manufacturing plant competitiveness. Int J Oper Prod Manag 21:233–253
Article Google Scholar
Thomas AJ, Byard P, Evans R (2012) Identifying the UK’s manufacturing challenges as a benchmark for future growth. J Manuf Technol Manag 23:142–156
Article Google Scholar
Tofail SAM, Koumoulos EP, Bandyopadhyay A, Bose S, O’Donoghue L, Charitidis C (2018) Additive manufacturing: scientific and technological challenges, market uptake and opportunities. Mater Today 21(1):22–37
Article Google Scholar
Egan PF (2023) Design for additive manufacturing: recent innovations and future directions. Designs 7(4):83
Article Google Scholar
Liu G, Zhang X, Chen X, He Y, Cheng L, Huo M, Yin J et al (2021) Additive manufacturing of structural materials. Mater Sci Eng: R: Rep 145:100596
Article Google Scholar
Rouf S, Malik A, Singh N, Raina A, Nida Naveed Md, Siddiqui IH, Haq MIU (2022) Additive manufacturing technologies: Industrial and medical applications. Sustain Oper Comput 3:258–274
Article Google Scholar
Lau KT, Taha MM, Rashid NH, Manogaran D, Ahmad MN (2022) Effect of HBN fillers on rheology property and surface microstructure of ABS extrudate. Jurnal Teknologi 84(4):175–182
Article Google Scholar
Kafle A, Luis E, Silwal R, Pan HM, Shrestha PL, Bastola AK (2021) 3D/4D Printing of polymers: fused deposition modelling (FDM), selective laser sintering (SLS), and stereolithography (SLA). Polymers 13(18):3101
Article Google Scholar
Ahmad MN, Ishak MR, Taha MM, Mustapha F, Leman Z (2022) Investigation of ABS–oil palm fiber (Elaeis guineensis) composites filament as feedstock for fused deposition modeling. Rapid Prototyp J 29(5):897–909
Article Google Scholar
Shi Z, Peng Y, Wei W (2014) Recent advance on fused deposition modeling. Recent Pat Mech Eng 7(2):122–130
Article Google Scholar
Hooda N, Chohan JS, Gupta R, Kumar R (2021) Deposition angle prediction of fused deposition modeling process using ensemble machine learning. ISA Trans 116:121–128
Article Google Scholar
Aguilar-Duque JI, Hernández-Arellano JL, Avelar-Sosa L, Amaya-Parra G, Tamayo-Pérez UJ (2019) Additive manufacturing: fused deposition modeling advances. Best Pract Manuf Process: Exp Latin Am. https://doi.org/10.1007/978-3-319-99190-0_16
Article Google Scholar
Sharma P, Harshal V, Vajpeyi R, Shubham P, Agarwal KM, Bhatia D (2022) Predicting the dimensional variation of geometries produced through FDM 3D printing employing supervised machine learning. Sens Int 3:100194
Article Google Scholar
Mitchell A, Lafont U, Hołyńska M, Semprimoschnig CJAM (2018) Additive manufacturing—a review of 4D printing and future applications. Addit Manuf 24:606–626
Google Scholar
Sheoran AJ, Kumar H (2020) Fused deposition modeling process parameters optimization and effect on mechanical properties and part quality: Review and reflection on present research. Mater Today: Proc 21:1659–1672
Google Scholar
Sandanamsamy L, Harun WSW, Ishak I, Romlay FRM, Kadirgama K, Ramasamy D, Idris SRA, Tsumori F (2023) A comprehensive review on fused deposition modelling of polylactic acid. Prog Addit Manuf 8(5):775–799
Article Google Scholar
Bhagia S, Bornani K, Agrawal R, Satlewal A, Ďurkovič J, Lagaňa R, Bhagia M et al (2021) Critical review of FDM 3D printing of PLA biocomposites filled with biomass resources, characterization, biodegradability, upcycling and opportunities for biorefineries. Appl Mater Today 24:101078
Article Google Scholar
Altan M, Eryildiz M, Gumus B, Kahraman Y (2018) Effects of process parameters on the quality of PLA products fabricated by fused deposition modeling (FDM): surface roughness and tensile strength. Mater Test 60(5):471–477
Article Google Scholar
Zisopol DG, Tănase M, Portoacă AI (2023) Innovative strategies for technical-economical optimization of FDM production. Polymers 15(18):3787
Article Google Scholar
Ngo TD, Kashani A, Imbalzano G, Nguyen KT, Hui D (2018) Additive manufacturing (3D printing): a review of materials, methods, applications and challenges. Compos Part B: Eng 143:172–196
Article Google Scholar
Kumar S, Gopi T, Harikeerthana N, Gupta MK, Gaur V, Krolczyk GM, Wu C (2023) Machine learning techniques in additive manufacturing: a state-of-the-art review on design, processes and production control. J Intell Manuf 34(1):21–55
Article Google Scholar
Wu D, Wei Y, Terpenny J (2019) Predictive modelling of surface roughness in fused deposition modelling using data fusion. Int J Prod Res 57(12):3992–4006
Article Google Scholar
Rai R, Tiwari MK, Ivanov D, Dolgui A (2021) Machine learning in manufacturing and industry 4.0 applications. Int J Prod Res 59(16):4773–4778
Article Google Scholar
Elahi M, Afolaranmi SO, Martinez Lastra JL, Perez Garcia JA (2023) A comprehensive literature review of the applications of AI techniques through the lifecycle of industrial equipment. Discov Artif Intell 3(1):43
Article Google Scholar
Nasrin T, Pourkamali-Anaraki F, Peterson AM (2024) Application of machine learning in polymer additive manufacturing: a review. J Polym Sci 62(12):2639–2669
Article Google Scholar
Ng WL, Goh GL, Goh GD, Ten JS, Yeong WY (2024) Progress and opportunities for machine learning in materials and processes of additive manufacturing. Adv Mater. https://doi.org/10.1002/adma.202310006
Article Google Scholar
Jumin E, Zaini N, Ahmed AN, Abdullah S, Ismail M, Sherif M, Sefelnasr A, El-Shafie A (2020) Machine learning versus linear regression modelling approach for accurate ozone concentrations prediction. Eng Appl Comput Fluid Mech 14(1):713–725
Google Scholar
Rastogi R, Anand P, Chandra S (2020) Large-margin distribution machine-based regression. Neural Comput Appl 32(8):3633–3648
Article Google Scholar
Frosyniotis D, Stafylopatis A, Likas A (2003) A divide-and-conquer method for multi-net classifiers. Pattern Anal Appl 6:32–40
Article MathSciNet Google Scholar
Chen T, Guestrin C (2016). XGBoost: A Scalable Tree Boosting System. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794).
Mastery ML 2016 A Gentle Introduction to XGBoost for Applied Machine Learning. Retrieved October 25: 2019.
Freund Y, Schapire RE (1997) A decision-theoretic generalization of on-line learning and an application to boosting. J Comput Syst Sci 55(1):119–139
Article MathSciNet Google Scholar
Prokhorenkova L, Gusev G, Vorobev A, Dorogush AV, Gulin A (2018) CatBoost: unbiased boosting with categorical features. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems (pp. 6638–6648).
Dorogush AV, Ershov V, Gulin A (2018) CatBoost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363.
Adisa AO, Colon AR, Kazmer DO, Peterson AM (2023) Interrelationships between process parameters, cross-sectional geometry, fracture behavior, and mechanical properties in material extrusion additive manufacturing. Polym Eng Sci 63(11):3906–3918
Article Google Scholar
Veeman D, MK Subramaniyan, M Vellaisamy, S Kannan. 2023 Fabrication of structurally graded material (pure PLA/WFPC): Mechanical and microscopic aspects. Proceedings of the Institution of Mechanical Engineers, Part E: Journal of Process Mechanical Engineering 09544089231190439.
Veeman D, Duraisami D, Subramaniyan M, Surendhar GJ, Yang C, Byun HS (2023) Numerical and experimental investigations on the mechanical behavior of additively manufactured novel composite materials for biomedical applications. J Ind Eng Chem 125:221–231
Article Google Scholar

Download references

Funding

No external funding was received for this research.

Author information

Authors and Affiliations

Centre for Additive Manufacturing, Chennai Institute of Technology, Chennai, Tamil Nadu, India
Dhinakaran Veeman, Murugan Vellaisamy, Pradeep Castro Ponnusamy, Mohan Kumar Subramaniyan & M. D. Vijayakumar
School of Materials and Chemical Engineering, Tongren University, Tongren, China
Lei Guo

Authors

Dhinakaran Veeman
View author publications
You can also search for this author in PubMed Google Scholar
Murugan Vellaisamy
View author publications
You can also search for this author in PubMed Google Scholar
Pradeep Castro Ponnusamy
View author publications
You can also search for this author in PubMed Google Scholar
Mohan Kumar Subramaniyan
View author publications
You can also search for this author in PubMed Google Scholar
M. D. Vijayakumar
View author publications
You can also search for this author in PubMed Google Scholar
Lei Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Dhinakaran Veeman or Mohan Kumar Subramaniyan.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Veeman, D., Vellaisamy, M., Ponnusamy, P.C. et al. Influence of optimization techniques on machine learning algorithms: compressive behaviour of additively manufactured poly lactic acid (PLA) for structural applications. Prog Addit Manuf (2024). https://doi.org/10.1007/s40964-024-00770-2

Download citation

Received: 17 May 2024
Accepted: 15 August 2024
Published: 04 September 2024
DOI: https://doi.org/10.1007/s40964-024-00770-2

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Influence of optimization techniques on machine learning algorithms: compressive behaviour of additively manufactured poly lactic acid (PLA) for structural applications

Abstract

Similar content being viewed by others

Optimizing flexural strength of fused deposition modelling using supervised machine learning algorithms

A Comparative Study of Artificial Neural Network and Regression Model for Hybrid Additive Manufacturing of Ti6Al4V Parts and Microstructural Analysis

Evolutionary AI-Based Algorithms for the Optimization of the Tensile Strength of Additively Manufactured Specimens

1 Introduction