Introduction

Basella alba is an underutilized tropical leafy vegetable commonly found in India and Indonesia. The plant has high nutritional value and medicinal properties [13]. The leaves are high in phytochemicals like polyphenols, flavonoids, and antioxidants [21]. According to Shankul [20], the antioxidant potential of B. alba is greater than that of spinach and moringa leaves. Aside from being consumed as a leafy vegetable, the bioactive potential of B. alba has yet to be adequately investigated in terms of extraction, processing, or utilization in food formulations. Extraction is the primary process in analyzing the bioactive compounds present in leaves, and hence, identifying appropriate extraction methods is critical in increasing the yield of bioactive components. The methods of extraction include conventional methods like soxhlet extraction, maceration, and hydro-distillation and novel techniques like ultrasound-assisted extraction (UAE), microwave-assisted extraction (MAE), enzyme-assisted extraction (EAE), pulsed electric field extraction (PEFE), pressurized liquid extraction (PLE), cold plasma assisted extraction and supercritical fluid extraction (SCFE). The primary drawback associated with conventional methods is their lengthy processing time, requirement for high-purity solvents, low rate of solvent recovery, and high cost of extraction [18, 22].

UAE is a highly efficient and promising green technique that can be automated, thereby streamlining and optimizing processes to a great extent. UAE can help in attaining the objective of sustainable green chemistry and extraction through its environmentally friendly and energy-efficient approaches [3]. UAE has been observed to enhance extraction yields and process kinetics in comparison to conventional extraction methodologies, along with a reduction in extraction time, energy consumption, and environmental impacts. Moreover, the advancement of environmentally friendly technology has been motivated by the increasing public interest in chemical-free substances and the prevention of harmful effluents. Potential challenges associated with implementing and applying these technologies, such as capital investment and operational costs, must be considered [8]. UAE works on the principle of ultrasonication, which applies sound wave at a frequency > 20 kHz, resulting in acoustic cavitation. The cellular matrix of the plant will undergo various changes such as fragmentation, localized erosion, pore creation, shear force, enhanced absorption, and swelling index due to the collapse of cavitation bubbles and the influence of sound waves. This process can lead to the disintegration of the cellular structure into smaller fragments, thereby reducing the particle size and increasing the overall surface area [11]. Consequently, these changes facilitate higher mass transfer rates within the solid matrix boundary layer, enabling efficient solubilization of the bioactive component in the solvent [17]. Ultrasound technique has facilitated the total extraction of phenolic compounds within a short span of time, giving high reproducibility, less solvent consumption, and high purity yield with energy efficiency [3]. In addition to the extraction method used, the bioactive recovery is influenced by the characteristics of the solvent, its concentration during extraction, the feed-to-solvent (F:S) ratio, sonication amplitude, duty cycle, and extraction time. Alves et al. [2] reported that the UAE of bioactive components from Monteverdia aquifolia leaves required less time, temperature, and F:S ratio than the conventional method. Kadam et al. [9] also reported that UAE enhanced the recovery and yield of bioactive compounds from Ascophyllum nodosum (brown seaweed). They obtained extraction yields of 143.12 mgGAE gdb−1, 87.06 mg gdb−1, and 128.54 mg gdb−1 for total phenolics, fucose, and uronic acid, respectively, at optimized conditions of treatment time (25 min), acid concentration (0.03 M HCl) and amplitude (114 μm).

RSM is a widely employed mathematical and statistical approach used in the modeling and optimization of the bioactive compound extraction process. RSM not only characterizes the influence of independent variables but also captures their interaction effects. Numerous studies have employed RSM technique to optimize the extraction of polyphenolic compounds from various leaf sources, such as oregano leaves [18], mandarin leaves [19], and B.alba leaves [21], and have observed its efficacy in process modelling and optimization. AAN is a predictive modeling tool that is becoming increasingly important in recent times. This tool utilizes a multilayer perceptron, consisting of an input and output layer of processing units called nodes interconnected through one or more hidden layers. The model is developed through a training process, where the input parameters of a set of experimental records are presented to the input layer of the network. These values are then multiplied by a weighting factor and transmitted to the nodes of the initial hidden layer. The combined and transformed values from the initial layer are subsequently integrated and modified through a transfer function. The resulting output is weighted and passed on to the subsequent layer. The outcomes generated by the output layer provide predictions for the dependent variables of the model [15]. GA is an optimization technique rooted in the principle of “survival of the fittest” and facilitates the acquisition of optimum solutions using a well-defined fitness function. Hybrid ANN-GA technique has been reported to be effective in optimizing the process parameters of extraction of bioactive components from plant materials [21]

The potential impact of ultrasound-assisted extraction (UAE) conditions on the bioactive components of B. alba leaves has not yet been thoroughly investigated and examined. Consequently, a deliberate effort was undertaken to explore and examine the influence and effect of the UAE conditions on the extraction process and recovery of the bioactive components from the leaves of B. alba. This was followed by the implementation of process optimization techniques, specifically the RSM and ANN-GA methodologies, to enhance and optimize the extraction process of the bioactive components from the B. alba leaves. These findings will help in using sustainable methods for extracting bioactive compounds from B. alba leaves and enhance their applications, especially in the food and therapeutic sectors.

Materials and methods

Raw material

Fresh B. alba leaves procured from the local market in Kharagpur, West Bengal, India, was used for the study. The leaves were cleaned and microwave dried at 700 W to a moisture content of 7–8% (wb) in a Microwave-UV-US combined reactor system (Nu tech Analytical Instruments Pvt Ltd, India [13]. The dried leaves were ground into fine powder and sieved through a BS 36 sieve to obtain a uniform particle size, which was used for further analysis.

Chemicals and reagents

The chemicals and reagents used in the investigation were 2, 2-diphenyl-1-picrylhydrazyl (DPPH), 2.4.6-tris(2-pyridyl)-s-triazine (TPTZ), acetic acid, acetone, aluminum chloride, ethanol, ferric chloride, folin–ciocalteau reagent (FCR), hydrochloric acid, methanol, sodium acetate, sodium carbonate, sodium hydroxide, and sodium nitrite acquired from Himedia Laboratories (Mumbai, India) and Merck specialties private limited (Mumbai, India).

Ultrasound-assisted extraction

The UAE was carried out using a probe sonicator (ATP 500, Athena Technologies Pvt. Ltd., India) with a power output of 500 W and a frequency of 40 kHz. The experiments were carried out at varying solvent concentrations, treatment times, and amplitude percentages. The solvent used was 99% ethanol. These solvents were selected based on their extraction capacity for a wide range of raw materials [5]. In accordance with the regulations of the US Food and Drug Administration, ethanol is classified as a solvent of class 3 and can be used safely within the prescribed limits [18]. Feed to solvent ratio was fixed at 1:15 based on the previous study [21]. After the extraction process, the samples were centrifuged at 6000 rpm for 10 min. This was subsequently followed by the filtration of the samples via Whatman No. 4 filter paper. The resultant extract was collected and stored in a refrigerated environment until further analysis.

Experimental design

A three-level Box—Behnken design with 5 central points was used to optimise the UAE conditions. The independent variables of the study were solvent concentration, extraction duration, and amplitude. The limits (upper and lower) of each independent variable have been identified through preliminary experiments. The independent variables are presented in Table 1, along with their coded representations. The various experimental combinations (17 runs) are listed in Table 2. The impact of the independent factors on the responses, namely TPC, GAEAC, and TFC, was mathematically modelled utilizing the RSM and the ANN-GA methodology.

Table 1 The limits of independent variables used for the study
Table 2 Actual, RSM-predicted, and ANN-predicted values of the response variables

Biochemical analysis of extracts

Total phenolic content (TPC)

TPC determination was carried out according to Singleton et al. [23] with minor modifications. The aforementioned procedure facilitates the conversion of the reaction mixture from a yellow hue to a blue shade through the interaction between phenolic compounds and FCR in an alkaline environment. Redox reagent FCR reacts with polyphenols to form phospho-tungstic-phospho-molybdenum complex. In brief, 0.2 mL of the extract was mixed with 2.8 mL of distilled water, 0.5 mL of FCR (10 times diluted), and 0.75 mL Na2CO3 (20% w/v) and incubated in the dark for 90 min. A UV–Vis spectrophotometer was used to measure absorbance at 760 nm wavelength. Gallic acid was used as the standard, and TPC was calculated as mg of gallic acid equivalents (mgGAEg−1 sample) per dry weight of the sample [18].

Gallic acid equivalent antioxidant capacity (GAEAC)

In a test tube, 0.2 mL of the extract was mixed with 2.85 mL pure methanol and 0.95 mL DPPH solution. Subsequently, the test sample was incubated for 30 min in the absence of light. Employing a UV–Vis spectrophotometer, the absorbance at a specific wavelength of 517 nm was determined. For calibration, pure methanol was used. Gallic acid was used as the reference substance [13].

Total flavonoids content (TFC)

TFC was measured by taking 0.5 mL of diluted extract in a test tube, followed by the addition of 2 mL of distilled water and 0.15 mL of NaNO2. To this mixture, 0.15 mL of a 10% AlCl3 solution was added to it after 4–5 min. Following that, 1 mL of 1 M NaOH was added after waiting 5 min, then 1.2 mL of distilled water. The sample was incubated for 10 min. Using a UV–Vis spectrophotometer, the absorbance was determined at 510 nm wavelength. TFC was calculated as mg quercetin equivalents per g dry weight sample weight (mgQEg−1 sample) using quercetin as the standard [18].

RSM modelling and optimisation

Process optimization was carried out in order to maximize the values of the response parameters. In the study, the empirical regression equation (Eq. 1) was used to examine the dependent parameters and predict the optimal values.

$${y}_{j}= {\beta }_{0}+ \sum_{i=1}^{3}{\beta }_{i}{x}_{i}+ \sum_{i=1}^{3}{\beta }_{ii}{x}_{i}^{2}+ \sum \sum_{i<j=1}^{3}{\beta }_{ij}{x}_{i}{x}_{j}$$
(1)

where \({y}_{j}\) is the response variable, \({\beta }_{0}\), \({\beta }_{i}\), \({\beta }_{ii}\) and \({\beta }_{ij}\) are the coefficients of regression, and \({x}_{i}\) & \({x}_{j}\) are the coded form of independent parameters. Statistical software, Design Expert 11 (Stat-Ease Inc., Minneapolis, USA) was employed for analysis. The validation of the model was ensured by conducting experimental tests on the optimal conditions for UAE in triplicate.

ANN modelling and GA optimization

The process of ANN modeling was implemented by utilizing the neural network fitting tool provided by MATLAB (version R2020a, The Math Works, Inc., USA). The ANN model composed of three layers, namely the input layer, hidden layer, and output layer (Fig. 1). The number of neurons present in the input layer was taken as three (solvent concentration [X1], extraction time [X2], and amplitude percentage [X3]). Hidden layer was having ten neurons and the output layer had a single neuron representing the response variables. Each response, variable was trained individually. The transfer function for the hidden layer was tansig function and for the output layer was purelin function. The Levenberg–Marquardt (LM) back propagation technique, specifically the trainlm algorithm, was employed to train the data. The training dataset consisted of 70% of the total data, while 15% each was allocated for validation and testing purposes. The training continued until the network achieved the optimal R2 and MSE metrics values. Later on, the output values were computed using the weights and bias values obtained from the given equation.

$${y}_{i}=purelin\{{W}_{OH}\times transig\times \left({U}_{IH}\times {x}_{i}+TH\right)+TO\}$$
(2)

where xi is the input parameter; yi is the output parameter; WOH and UIH are the weights between different layers; TO and TH are the bias values of neurons.

Fig. 1
figure 1

Proposed ANN model and its various layers

The process of optimization using the Genetic Algorithm (GA) was conducted by utilizing the GA toolbox of MATLAB (version R2015a, The Math Works, Inc., USA). The GA optimization primarily focused on selecting specific parameters, namely the population type, which was represented as a double vector, the population size set at a value of 200, and the crossover fraction designated as 0.8. Furthermore, several additional factors were taken into consideration during the GA optimization process. These factors included the migration aspect, the creation function, the fitness scaling function, the selection function, the crossover function, and the mutation function. Specifically, forward migration, feasible population, rank, roulette, scattered, and adaptive feasibility were chosen for each respective factor. These selections were determined based on the research conducted by Abdullah et al. [1]. Additionally, a fitness function, denoted as 'f,' was formulated with the intention of maximizing all output responses. The specific representation of this fitness function is provided below.

$$f=-({Y}_{1}+{Y}_{2}+{Y}_{3})$$
(3)

where Y1, Y2, and Y3 indicate the predicted values of responses.

Statistical analysis

The statistical evaluation of the developed models was conducted using various metrics as specified in Eqs. 4, 5, 6, 7, 8, 9, 10 [7, 21]. In order to identify the best model for precisely representing the responses, the model with the lowest values for AAD, MSE, NMSE, RMSE, and NRSME, and the highest value of R2 was chosen.

$${R}^{2}=1-\frac{\sum \left({X}_{p}-{X}_{a}\right)}{\sum {\left({X}_{p}-{X}_{a}\right)}^{2}}$$
(4)
$$AAD=\frac{\sum |{X}_{p}-{X}_{a}|}{n}$$
(5)
$$MSE=\frac{\sum {\left({X}_{p}-{X}_{a}\right)}^{2}}{n}$$
(6)
$$NMSE=\frac{MSE}{{X}_{m}}$$
(7)
$$MPE=\frac{100}{n}\sum |\frac{{X}_{p}-{X}_{a}}{{X}_{p}}|$$
(8)
$$RMSE=\sqrt{\frac{\sum {\left({X}_{p}-{X}_{a}\right)}^{2}}{n}}$$
(9)
$$NMRSE=\frac{RMSE}{{X}_{m}}$$
(10)

where xp represents predicted data, xa represents experimental data, xm represents mean experimental data, and n is the number of experiments.

Results & discussion

Effect of process parameters on TPC

The changes in TPC during UAE are presented in Table 2. TPC values varied from 5.303 to 8.227 mgGAEg−1 in the extracted samples. The effect of independent variables on UAE of TPC from B. alba leaves is shown in RSM plots (Fig. 2). The highest value of TPC (8.227 mgGAEg−1) was obtained at 80% amplitude, 75% concentration, and extraction time of 10 min. The standard curve for the estimation of TPC & TFC was shown in Fig. 4. Regression analysis of the obtained data (Table 3) shows that the linear term of extraction time and amplitude percentage had a significant effect (p < 0.05) on TPC. The multiple linear regression equation (MLR) equation obtained showing the effect of process variables on TPC is presented in Eq. 11. The solvent concentration and extraction time interaction also showed a significant effect on TPC. The increase in TPC with increasing treatment time could be attributed to an increase in microbubble formation, which increased cell damage and resulted in the release of more phenolic compounds [10]. The extraction yield of TPC increased initially with amplitude percentage, after which it reduced. A similar trend has been reported by Alves et al. [2] during the UAE of Monteverdia aquifolia leaves. The change in amplitude influences several factors, including the intensification and/or restriction of sonochemical reactions, polymerization and/or depolymerization, chemical bond rupture, and intensification of undesirable interactions between extracted compounds, favouring the production of oxidizing agents, resulting in the degradation of phenolic compounds and limiting their antioxidant capacity [4]. Solvent concentration had significant negative quadratic effects (p < 0.05) on TPC. The TPC value increased with an increase in solvent concentration initially (up to 70%), after which it declined. Li et al. [14] noticed a similar trend in the UAE for phenolic chemicals extracted from Perilla leaves using ethanol as a solvent. They reported that higher water concentrations in ethanol–water combinations were shown to extract a greater proportion of polar phenolic chemicals. The model exhibited an insignificant lack of fit value [(p > 0.05) with satisfactory prediction (R2 = 0.94, Adj. R2 = 0.88].

Fig. 2
figure 2

Response surface plots showing the influence of process variables on ac TPC, df TFC, and gi AA

Table 3 Regression coefficients and other statistical parameters of the RSM model
$$\text{TPC }\left({mg GAEg}^{-1}\right) =8.04 -0.20A+0.47B+0.29C+0.57AB+0.18AC-0.14BC-{1.36A}^{2}+{0.09B}^{2}-{0.67C}^{2}$$
(11)

Total flavonoids content (TFC)

The values of TFC ranged from 4.139 to 8.124 mgQEg−1 in extracted samples. The changes in TFC with different processing conditions are presented in Table 2. All linear components (such as time, amplitude, and solvent concentration) had a highly significant (p < 0.01) effect on TFC, in accordance with the regression coefficients presented in Table 3. TFC showed a positive correlation with solvent concentration, and the values increased with time at higher concentrations. The solvent concentration and extraction time interaction also showed a significant positive effect (p < 0.05) on TFC. Solvent concentration and amplitude percentage showed significant negative quadratic effects (p < 0.05) on TFC. Extraction time had an insignificant linear effect on the extraction of TFC. The MLR equation obtained showing the effect of process variables on TFC is presented in Eq. 12.

$$TFC=7.71 +1.09A+0.36B+0.23C+0.68AB+0.35AC-0.30BC-{1.87A}^{2}-{0.09B}^{2}-{0.53C}^{2}$$
(12)

TFC increased with solvent concentration and amplitude percentage and then decreased. Increasing the amplitude of ultrasonic waves frequently leads to an augmentation in the sonochemical impacts, resulting in a greater yield in terms of TFC [3]. The decrease in TFC at 80% amplitude could be attributed to the acceleration of intracellular mass transfer and the degrading effect of ultrasound at elevated output power [6].

Gallic acid equivalent antioxidant capacity (GAEAC)

GAEAC values ranged from 0.285 to 0.627 mgGAEg−1 in the samples extracted under different treatment combinations. The highest value of GAEAC (0.627 mgGAEg−1) was obtained at a solvent concentration of 100%, a lower extraction time of 5 min, and an amplitude of 80%. Regression analysis demonstrated that the linear component of the solvent concentration and the extraction process's duration had a notable impact (p < 0.05) on GAEAC. Significantly negative quadratic effects (p < 0.05) were observed between solvent concentration and GAEAC. The model exhibited an insignificant lack of fit value (p > 0.05) with satisfactory prediction (R2 = 0.95, Adj. R2 = 0.88). The multiple linear regression (MLR) equation obtained showing the effect of process variables on GAEAC is presented in Eq. 13. The samples also observed a decrease in GAEAC values with higher treatment time. This is attributed to prolonged extraction time induced by detrimental oxidation reactions [12, 16].

$$GAEAC=0.55+0.10\text{A}-0.04\text{ B}-0.009\text{C}+0.04\text{AB}-0.003\text{AC}+0.0005\text{BC}-{0.06465\text{A}}^{2}+{0.01\text{B}}^{2}+{0.01\text{C}}^{2}$$
(13)

RSM optimisation

RSM was employed to optimize the extraction of bioactive compounds from B. alba leaves. The values of the responses analyzed under different ultrasonic-assisted extraction (UAE) conditions are presented in Table 2. The regression coefficients obtained for each model and other statistical parameters are summarized in Table 3. Additionally, Table 4 provides other relevant statistical parameters. The model’s accuracy was confirmed by its high R2 value. The adjusted R2 values are very similar to the R2 values, affirming a profound correlation between the predicted and actual values. Moreover, the coefficient of variation (CV) remained below 5% in every instance, thereby substantiating the reproducibility and precision of the model.

Table 4 Summary of the developed model’s statistical parameters

ANN modeling

Model fitting

The study used the LM algorithm to model the dependent variables, specifically TPC, TFC, and AA. In order to achieve this, the ANN framework employed 3, 10, and 1 neuron(s) in the input layer, hidden layer, and output layer, respectively. Figure 1 depicts a visual representation of this architecture. The best model was chosen based on the minimum MSE and maximum R2 value criterion. The validation performance for TFC, TPC, and AA, which were the factors under consideration, demonstrated the best results at epochs 4, 5, and 6, respectively. In order to ensure the highest level of rigor in the validation process, the 17 datasets were meticulously divided into three distinct sets during the experiment. This partitioning allowed for an accurate assessment of the models' performance. The training data set for TPC consisted of runs 4, 5, 6, 7, 8, 10, 11, 13, 14, 15, and 17, encompassing a comprehensive range of data points. The R2 values, representing the coefficient of determination, were calculated for the training, validation, and testing stages, resulting in values of 0.973, 0.931, and 0.964, respectively. Furthermore, the MSE values for these runs were determined to be 0.062, 0.056, and 0.030, providing additional insight into the accuracy of the models. Similarly, runs 2, 3, 4, 7, 8, 9, 11, 12, 14, 16, and 17, comprising 11 observations, were used for training in the case of TFC. A total of 3 observations were allocated for validation, while another three observations were designated for testing. The corresponding R2 values obtained for the training, validation, and testing sets were recorded as 0.999, 0.945, and 0.975, respectively. Moreover, the MSE values were calculated to be 0.039, 0.0326, and 0.464, respectively. Employing a similar approach, runs 1, 3, 5, 6, 7, 8, 11, 12, 13, 15, and 17 were employed for training in the modeling of AA. Again, three observations were allocated for validation, while three observations were set aside for testing. The R2 values obtained for the training, validation, and testing of AA were measured to be 0.851, 0.998, and 0.987, respectively. The corresponding MSE values were found to be 0.029, 0.020, and 0.007, respectively. Figure 3 depicts the performance and error histogram of the developed model. The optimization of the ANN model was carried out by comparing the output to the input data. During the optimization procedure, weights were used to minimize the error function. Equations 14, 15, 16, 17, 18, 19, 20, 21 show the values of weights and bias of the ANN model (Fig. 4).

Fig. 3
figure 3

Performance plot after training and error graph of a, d TPC, b, e TFC, and c, f AA of generated ANN model

Fig.4
figure 4

Standard curves for the estimation of a TPC and b TFC

$${U}_{a}=\left|\begin{array}{c}0.9152\\ -0.4139\\ -1.9691\\ 0.1437\\ -0.8037\\ -1.8087\\ -0.3770\\ -0.3313\\ 0.3005\\ 1.4639\end{array}\begin{array}{c}1.8404\\ 2.9872\\ -1.8980\\ -2.9160\\ -0.3515\\ 2.1159\\ 2.7250\\ -0.3246\\ -.09788\\ -2.1134\end{array}\begin{array}{c}2.2074\\ 0.0545\\ -1.2719\\ -0.7574\\ -2.8858\\ 1.1615\\ -1.2369\\ 2.9803\\ -2.8371\\ 1.5773\end{array}\right|$$
(14)
$${U}_{b}=\left|\begin{array}{c}-1.9170\\ -1.6195\\ -0.1049\\ 0.4369\\ 0.8438\\ -1.9486\\ -2.8187\\ -1.8039\\ 1.1874\\ -0.5843\end{array}\begin{array}{c}2.3105\\ 2.2493\\ -0.8751\\ -2.2135\\ 0.4724\\ 1.3766\\ -0.7150\\ -0.0612\\ -1.6173\\ 2.7436\end{array}\begin{array}{c}-0.1996\\ -0.1267\\ -2.7861\\ -2.1410\\ -2.8853\\ 1.8252\\ 1.4813\\ -2.5164\\ 2.0010\\ 1.1119\end{array}\right|$$
(15)
$${U}_{c}=\left|\begin{array}{c}-0.0194\\ 1.7138\\ -1.6759\\ -2.4119\\ -1.2534\\ 0.7769\\ -0.9260\\ 0.7927\\ -2.3907\\ 1.4252\end{array}\begin{array}{c}-2.6831\\ -2.5459\\ -0.6537\\ 1.3666\\ 2.0736\\ 2.5889\\ 2.0808\\ 1.9759\\ 1.7696\\ 2.0209\end{array}\begin{array}{c}1.3014\\ -0.4548\\ -2.6249\\ 0.8751\\ 1.7947\\ -1.3248\\ 1.7977\\ -2.1328\\ -0.5702\\ -1.3769\end{array}\right|$$
(16)
$${W}_{a}=\left|0.2532 0.0775 0.3010 0.4533 -0.8110 0.7551 -0.9713 -0.4114 -0.6402 0.8526\right|$$
(17)
$${W}_{b}=\left| 0.1679 0.3876 -0.3662 -0.3264 -0.4711 -0.4878 -0.8169 0.1664 0.0332 -0.1497\right|$$
(18)
$${W}_{c}=\left|-0.4168 0.5613 -0.5151 0.3661 -0.1071 0.3107 -0.6472 -0.2142 0.1231 0.2701\right|$$
(19)
$${TH}_{a}=\left|\begin{array}{c}-3.0162\\ 2.3459\\ 1.6757\\ -1.0054\\ 0.3351\\ -0.3351\\ -1.0054\\ -1.6757\\ 2.3459\\ 3.0162\end{array}\right| \quad {TH}_{b}=\left|\begin{array}{c}3.0593\\ 2.6877\\ 1.7362\\ -0.9382\\ -0.2928\\ -0.3825\\ -0.5757\\ -1.5204\\ 2.4399\\ -3.0067\end{array}\right| \quad {TH}_{b}=\left|\begin{array}{c}-3.0242\\ -2.2567\\ 1.4163\\ 1.1592\\ 0.3401\\ 0.0391\\ -0.9678\\ 1.6666\\ -2.3317\\ 3.1870\end{array}\right|$$
(20)
$${TO}_{a}=\left|-0.8636\right| \quad {TO}_{b}=\left|-0.4281\right| \quad {TO}_{c}=\left|0.4131\right|$$
(21)

GA optimization

With the aim of increasing the extraction efficacy of TFC, TPC, and AA, the GA optimization process was conducted. The process of optimization was repeated until the MSE and RSME values between the average and individual fitness values were substantially lowered. After the mutation, the optimization cycle continued, and if the intended solution was not achieved, the entire population that was obtained was once again used for reproduction, crossover, and mutation in subsequent cycles. After 96 iterations, the optimal parameters for the UAE were identified to be: solvent concentration of 63.01%, treatment period of 8.92 min, and amplitude percentage of 90. The outcomes predicted by the developed ANN model for the response variables can be observed in Table 5. Subsequently, the experimentally tested optimal process variables were determined. Upon measurement, the obtained responses were determined to be 7.601 mgGAEg−1 of TPC, 8.67 mgQEg−1 of TFC, and 0.671 mgGAEg−1 of AA. It is noteworthy that these experimental findings align well with the expected outcomes, thereby showcasing the precise prediction and optimization capabilities of the ANN-GA model.

Table 5 Optimal UAE conditions for maximum yield of bioactives from B. alba leaves

Conclusion

UAE is gaining popularity due to its efficacy in phytochemical recovery in less time and low-cost approaches. The technique works through the formation and collapse of cavitation bubbles, resulting in the breakdown and easy release of the cellular matrix to solvent. The influence of UAE conditions on the recovery and quantification of bioactive components from B. alba leaves was examined in this work. RSM and the ANN-GA approach were used to investigate, predict, and optimize the influence of UAE conditions on bioactive components. TFC, TPC, and AA, which were extracted using RSM, exhibited predicted values of 8.31 mgQEg−1, 8.23 mgGAEg−1, and 0.59 mg GAEACg−1, respectively. On the other hand, the predicted values for TFC, TPC, and AA obtained through ANN-GA were 8.52 mgQEg−1, 7.64 mgGAEg−1, and 0.67 mgGAEACg−1, respectively. Both models demonstrated a high level of predictability. However, the ANN model outpaced the RSM model due to its lower values of AAD, NSME, MSE, RSME, NRSME, and MPE, as well as its higher R2 score. Accordingly, it can be concluded that the ANN-GA is a viable quantitative tool for optimizing the process variables for the UAE of B. alba. With continued technological developments in the UAE that involve the development of sophisticated machinery, integration with process modeling, and scaling up capabilities, its commercial applications are expected to expand. This will enable the development of more efficient and environmentally friendly alternatives to traditional extraction methods. Furthermore, the findings will help in widening the use of B. alba in various food formulations and pharmaceutical applications.