A hybrid novel SVM model for predicting CO2 emissions using Multiobjective Seagull Optimization

Ehteram, Mohammad; Sammen, Saad Sh.; Panahi, Fatemeh; Sidek, Lariyah Mohd

doi:10.1007/s11356-021-15223-4

A hybrid novel SVM model for predicting CO₂ emissions using Multiobjective Seagull Optimization

Research Article
Published: 30 July 2021

Volume 28, pages 66171–66192, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Environmental Science and Pollution Research Aims and scope Submit manuscript

A hybrid novel SVM model for predicting CO₂ emissions using Multiobjective Seagull Optimization

Download PDF

Mohammad Ehteram¹,
Saad Sh. Sammen ORCID: orcid.org/0000-0002-1708-0612²,
Fatemeh Panahi³ &
…
Lariyah Mohd Sidek⁴

503 Accesses
27 Citations
Explore all metrics

Abstract

The agricultural sector is one of the most important sources of CO₂ emissions. Thus, the current study predicted CO₂ emissions based on data from the agricultural sectors of 25 provinces in Iran. The gross domestic product (GDP), the square of the GDP (GDP²), energy use, and income inequality (Gini index) were used as the inputs. The study used support vector machine (SVM) models to predict CO2 emissions. Multiobjective algorithms (MOAs), such as the seagull optimization algorithm (MOSOA), salp swarm algorithm (MOSSA), bat algorithm (MOBA), and particle swarm optimization (MOPSO) algorithm, were used to perform three important tasks for improving the SVM models. Additionally, an inclusive multiple model (IMM) used the outputs of the MOSOA, MOSSA, MOBA, and MOPSO algorithms as the inputs for predicting CO₂ emissions. It was observed that the best kernel function based on the SVM-MOSOA was the radial function. Additionally, the best input combination used all the gross domestic product (GDP), squared GDP (GDP²), energy use, and income inequality (Gini index) inputs. The results indicated that the quality of the obtained Pareto front based on the MOSOA was better than those of the other algorithms. Regarding the obtained results, the IMM model decreased the mean absolute errors of the SVM-MOSOA, SVM-MOSSA, SVM-MOBA, and SVM-PSO models by 24, 31, 69, and 76%, respectively, during the training stage. The current study showed that the IMM model was the best model for predicting CO₂ emissions.

China’s carbon dioxide emission forecast based on improved marine predator algorithm and multi-kernel support vector regression

Article 18 August 2022

Support Vector Machine Modeling Using Particle Swarm Optimization Approach for the Retrieval of Atmospheric Ammonia Concentrations

Article 12 December 2015

Support vector regression optimized by meta-heuristic algorithms for daily streamflow prediction

Article 11 September 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Environmental degradation is one of the main challenges faced by policymakers and decision-makers. The increasing emission rate of CO₂ into the atmosphere is one of the important causes of environmental degradation. Increasing industrialization and urbanization are the other causes of CO₂ emissions in developing countries (Zhao et al. 2018). The increasing levels of CO₂ emissions affect global warming and climate change. Additionally, ocean acidification and desertification are the other consequences of the emission of CO₂ into the atmosphere. Pollution levels and related diseases are increased by the emission of CO₂. Hence, CO₂ emissions affect human health. Thus, predicting CO₂ emissions is one of the most important issues for researchers. The agricultural sector is one of the most important sources of CO₂ emissions. In 2018, 9.9% of greenhouse gas emissions were related to the agricultural sector. Cows, agricultural soils, and rice production can increase CO₂ emissions. The agricultural sector causes 10–14% of global anthropogenic greenhouse gas emissions (Shabani et al. 2021). If decision-makers want to manage CO₂ emissions, it is necessary to accurately estimate the CO2 emissions caused by different sectors, such as the agricultural sector. Predictive models utilize parameters that are relevant to CO₂ emissions to estimate the amounts of emitted CO₂ in different years. Shi et al. (2019a) stated that the CO₂ emission depends on different parameters such as industrial activity and energy intensity. Shi et al. (2019b) stated that the household sector of china is responsible for 12.6% of CO₂ emissions.

In recent years, machine learning algorithms (MLAs) have been widely used for predicting different variables, such as climate variables, pollutants, gas emissions, and hydrological variables (Banadkooki et al. 2020a). The advantages of MLAs include their handling of unlimited input data, fast processing speed, and accurate predictions. In the context of CO₂ emissions, researchers have used different MLAs to predict the CO₂ emission and greenhouse gases (GG). Table 1 reviews CO₂ emission forecasting approaches. As observed in Table 1, the MLAs have the high ability for predicting CO₂ emissions. SVM is a family member of MLAs that are widely used for predicting target variables. The SVM models have the high ability in high-dimensional space. The modelers can define different kernel functions depending upon their requirement. The SVM models provide good generalization capability. Also, they can reduce the computational complexity. Saleh et al. (2016) investigated the ability of SVM model in predicting CO₂ emission. The data of electrical energy and burning coal were used as the inputs to the models. They used trial and error to adjust the parameters of the SVM model. It was concluded that the SVM model was effective for predicting CO₂ emission. Sun and Liu (2016) exploited the least square SVM (LSSVM) model to predict different kinds of CO₂ emission. They concluded that the classification of CO₂ emission enhanced forecast accuracy. Ahmadi et al. (2019a) explored the use of least square SVM (LSSVM) in predicting CO₂ emission. They used evolutionary algorithms to train the SVM model. They coupled genetic algorithm (GA) with particle swarm optimization (PSO) to make a new hybrid algorithm for training SVM model. It was concluded that the LSSVM-PSO-GA had better efficiency in predicting CO₂ emission compared with LSSVM-GA and LSSVM-PSO models. Wu and Meng (2020) coupled LSSVM with bat algorithm (BA) for predicting CO₂ emission. It was observed that the LSSVM-BA outperformed the extreme learning model and backpropagation neural network.

Table 1 Detailed paper reviewed

Full size table

Generally, MLAs have strong prediction abilities with respect to CO2 emissions, but there are challenges:

1-
The model parameters of MLAs need to be tuned based on powerful training algorithms such as advanced optimization algorithms (Ehteram et al. 2020; Banadkooki et al. 2021).
2-
Some MLAs, such as the ANN, SVM, and ANFIS models, have different kinds of kernel functions and activation functions. Thus, the best function should be selected to predict the target variable based on the received input data (Darabi et al. 2021).
3-
The selection of the best input scenario for each MLA requires preprocessing methods.
4-
Previous studies compared various models and determined a superior model for predicting CO₂ emissions. In fact, competing models are rejected or accepted based on their accuracy, but the main question is how the synergy among multiple models can be used.

To address the abovementioned challenges, the current study uses a new hybrid framework for predicting CO₂ emissions. One of the most powerful models for predicting target variables such as CO₂ emissions is the SVM. The kernel functions of the SVM have the same parameters. The values of the SVM parameters can be obtained based on robust optimization algorithms. In this study, four multiobjective algorithms (MOAs), namely, the MO seagull optimization algorithm (MOSOA), MO salp swarm algorithm (MOSSA), MO bat algorithm (MOBA), and MO particle swarm optimization (MOPSO), are used to improve the performance of the SVM model and address the challenges mentioned above:

1-
To obtain the values of the SVM parameters, an objective function, such as the root mean square error (RMSE), is used as the first objective function.
2-
To choose the best kernel function, the names of kernel functions are inserted into the optimization algorithms as decision variables. A second objective function, such as the mean square error (MAE), is used to choose the best kernel function.
3-
The names of the input variables are inserted into the algorithms as the decision variables. The Nash Sutcliffe efficiency is used as the third objective function for choosing the name each of input variables.
4-
An inclusive multiple model is used to predict CO₂ emissions based on the synergy among the SVM-MOSOA, SVM-MOSSA, SVM-MOBA, and SVM-MOPSO algorithm.

Regarding the abovementioned points, the main novelties of the current paper are as follows:

1-
The establishment of new hybrid SVM models that were not used in previous articles for predicting CO₂ emissions.
2-
The creation of an inclusive multiple model to use the contributions of different SVM models for predicting CO₂ emissions.
3-
The application of the current study is not limited to CO₂ emissions, and modelers can use the proposed models for predicting other variables, such as hydrological variables.
4-
The presentation of an effective approach for finding the best values of the random parameters of the compared multiobjective algorithms.

To the best of the authors’ knowledge, no previous article has investigated the new hybrid SVM models proposed in the current study for predicting CO₂ emissions. In this study, new hybrid SVM models are used to predict CO₂ emissions in the agricultural sector of Iran based on data from 20 provinces (Fig. 1). Section 2 of the current study explains the structures of the compared models and the methods utilized. The case study is explained in Section 3. Section 4 presents a discussion and the experimental results. Section 5 explains the conclusions of the paper.

Materials and methods

Structure of the support vector machine

The first version of the SVM was introduced by Sain and Vapnik (1996). The SVM model uses a kernel function to find the relationships between the model inputs and outputs. The linear form of the SVM is as follows:

$$ f(x)={\eta}^{Tr}.x+\beta $$

(1)

where x is the input, Tr denotes the transpose operation, η denotes the weighting coefficients of the input variables, β is the bias, and f(x) is the variable predicted by the SVM.

The aim of the SVM is to minimize the difference between the predicted values and observed values. Thus, an optimization problem is defined to minimize the error function, which is named the e-insensitive loss function. The SVM acts based on the following equations:

$$ \mathit{\operatorname{Minimize}}\frac{1}{2}{\left\Vert \eta \right\Vert}^2+ PE\sum \limits_{i=1}^m\left({\psi}_i^{-}+{\psi}_i^{+}\right) $$

(2)

$$ {\displaystyle \begin{array}{c}\boldsymbol{subject}\left(\boldsymbol{to}\right)\left({\eta}_i.{x}_i+\beta \right)-{z}_i<\varepsilon +{\psi}_i^{+},i=1,2,..,m\\ {}{z}_i-\left({\eta}_i.{x}_i+\beta \right)\le \varepsilon +{\psi}_i^{-}\end{array}} $$

(3)

where PE is the penalty coefficient, m is the number of observed data, $ {\psi}_i^{-} $ and $ {\psi}_i^{+} $ represent the violation of the ith training data ε: permitted error threshold, x_i is the input variable, z_i is the target variable, and η_i is the ith weight variable. The weight variable and bias are obtained based on Eqs. 2 and 3. Then, they are inserted into Eq. 1 to obtain f(x). The SVM uses several kernel functions to map the dataset to the linearly separable space.

$$ f(x)={\eta}^{Tr}.K\left(x,{x}_i\right)+\beta $$

(4)

Sigmoid function:

$$ K\left(x,{x}_i\right)=\mathit{\tanh}\left(\gamma \left(x.{x}_i\right)+r\right) $$

(5)

Radial basis function (RBF):

$$ K\left(x,{x}_i\right)=\mathit{\exp}\left(-\gamma {\left\Vert x-{x}_i\right\Vert}^2+ PE\right) $$

(6)

Polynomial function:

$$ K\left(x,{x}_i\right)={\left(\gamma \left(x,{x}_i\right)+r\right)}^d $$

(7)

where K(x, x_i) is the kernel function and γ, r, PE, d, and ε are kernel parameters (the values of the kernel parameters are obtained based on the corresponding MOAs).

Seagull optimization algorithm

The SOA has been widely applied in different fields, with applications such as multiobjective optimization (Dhiman et al. 2020; Dhiman and Kumar 2019), feature selection (Jia et al. 2019), experimental fuel cell modeling (Cao et al. 2019), and classification (Jiang et al. 2020). Few parameters, easy implementation, fast convergence, and the use of swarm experience are the advantages of the SOA. The migrating birds are the prey of the seagulls. The seagulls use natural spiral-shaped movement to attack migrating birds. Seagulls live in a group and travel towards the direction of the seagull that is most fit for survival. The mathematical model of the SOA is based on the migration phase and attack phase:

(1) Migration phase

When the seagulls move in the search space, collisions between neighbors should be avoided. Thus, the new locations of the search agents (seagulls) are computed as follows:

$$ {\overrightarrow{P}}_s=\alpha \times {\overrightarrow{L}}_s(x) $$

(8)

where $ {\overrightarrow{P}}_s $ is the location of the search agent that prevents collisions with the other seagulls, $ {\overrightarrow{L}}_s(x) $ is the location of the seagull during iteration (i), and α is the motion behavior of the seagull. αis computed as follows:

$$ \alpha ={f}_c-\left(i\times \left(\frac{f_c}{\mathit{\operatorname{Max}}(i)}\right)\right) $$

(9)

where f_c is the frequency control of α and i is the number of iterations. In the next phase, the seagulls move towards the direction of the best neighbor:

$$ {\overrightarrow{N}}_e=\zeta \times \left({\overrightarrow{P}}_b-{\overrightarrow{L}}_s\right) $$

(10)

where $ {\overrightarrow{N}}_e $ is the location of seagull $ {\overrightarrow{L}}_s $ (moving towards the best seagull $ {\overrightarrow{P}}_b $) and ζ is a random value. A random value is obtained based on the following equation:

$$ \zeta =2\times {\alpha}^2\times RA $$

(11)

where RA is a random number. Finally, the seagulls update their locations as follows:

$$ {\overrightarrow{F}}_s=\left|{\overrightarrow{P}}_s+{\overrightarrow{N}}_s\right| $$

(12)

where $ {\overrightarrow{F}}_s $ is the distance between the search agent and best seagull.

(2) Attack phase

During the attack, seagulls maintain their altitude using their wings and weight. The speed and angle of the seagull attack are changed continuously. They use spiral movements to attack their prey, as follows:

$$ x{x}^{\prime }=r\times \mathit{\cos}\left(\tau \right) $$

(13)

$$ y{y}^{\prime }=r\times \mathit{\sin}\left(\tau \right) $$

(14)

$$ z{z}^{\prime }=r\times \tau $$

(15)

$$ r=u\times {e}^{\tau v} $$

(16)

where r is the radius of each spiral turn, xx^′ is the position of the seagull in plan x, yy^′ is the position of the seagull in plan y, zz^′ is the location of the seagull in plan z, u and v are constants, and τ is a random number. Finally, the seagulls change their locations as follows:

$$ {\overrightarrow{L}}_s=\left({\overrightarrow{F}}_s\times x{x}^{\prime}\times y{y}^{\prime}\times z{z}^{\prime}\right)+{\overrightarrow{P}}_b $$

(17)

where $ {\overrightarrow{L}}_s $ stores the best solution. Figure 2 shows the flowchart of the SOA.

Salp swarm algorithm

The SSA is widely applied for optimization problems in different fields, such as feature selection (Tubishat et al. 2021), engineering optimization (Salgotra et al. 2021), optimal power flow calculation (Abd el-sattar et al. 2021), and ANN training (Kandiri et al. 2020). The SSA has a good balance between exploration and exploitation, as well as fast convergence. The positions of salps in an optimization problem signify a candidate solution. The first salp at the front of the salp chain is called the leader, and the other salps are called followers. Figure 3 shows a salp chain. The position of the leader is changed as follows:

$$ {\boldsymbol{Salp}}_{1,j}=\left[\begin{array}{c}{\boldsymbol{Food}}_j+{\rho}_1\times \left(\left({up}_j-{low}_j\right)\times {\rho}_2+{low}_j\right)\leftarrow {\rho}_3\ge 0.50\\ {}{\boldsymbol{Food}}_j-{\rho}_1\times \left(\left({up}_j-{low}_j\right)\times {\rho}_2+{low}_j\right)\leftarrow {\rho}_3<0.50\end{array}\right] $$

(18)

where Salp_{1, j} is the location of leader, Food_j is the food source in the jth dimension, ρ₁ is a control parameter, and ρ₃ and ρ₂ are random parameters. ρ₁ is updated as follows:

$$ {\rho}_1=2\times {e}^{{\left(\frac{-4\times l}{L}\right)}^2} $$

(19)

where L is the maximum number of iterations and l is the current number of iterations. The locations of the followers are updated as follows:

$$ {fol}_{i,j}=\frac{1}{2}\left({fol}_{i,j}+{fol}_{i-1,j}\right) $$

(21)

where fol_{i, j} is the position of the ith follower in the jth dimension. Figure 4 shows the SSA flowchart.

Bat algorithm

The BA was inspired by the behavior of bats when finding food. The BA has been applied in different fields, such as image segmentation (Yue and Zhang 2020), parameter extraction for photovoltaic models (Deotti et al. 2020), MLA training (Dong et al. 2020), optimal reactive power dispatching (Mugemanyi et al. 2020), continuous optimization (Chakri et al. 2017), and numerical optimization (Wang et al. 2019). The bats use echolocation behavior to distinguish food from obstacles. A bat adjusts its distance from food based on its wavelength, frequency, and pulsation rate. The bats update their frequencies, velocities, and locations as follows:

$$ {fre}_i={fre}_{\mathrm{min}}+{r}_1\left({fre}_{\mathrm{max}}-{fre}_{\mathrm{min}}\right) $$

(22)

$$ {ve}_{t+1}^i={ve}_t^i+{f}^i\left({X}_t^i-{X}_t^{\boldsymbol{best}}\right) $$

(23)

$$ {X}_i^{t+1}={X}_i^t+{ve}_i^{t+1} $$

(24)

where fre_i is the frequency of the ith bat, fre_min is the minimum frequency, r₁ is a random value, $ {ve}_{t+1}^i $ is the velocity of the ith bat at iteration t+1, $ {X}_t^i $ is the location of the ith bat at iteration t, and $ {X}_t^{\boldsymbol{best}} $ is the location of the best bat (the best solution). The bats use a random walk to perform the local search operation:

$$ {X}_i^{t+1}={X}_i^t+\varphi {A}_i^{t+1} $$

(25)

where φ is a random number and $ {A}_i^{t+1} $ is a loudness parameter. The pulsation rate and loudness are adjusted based on the following equations:

$$ {A}_i^{t+1}=\mu {A}_i^t $$

(26)

$$ {r}_i^{t+1}={r}_i(0)\left[1-\mathit{\exp}\left(-\gamma t\right)\right] $$

(27)

where μ and γ are constant values, r_i(0) is the initial value of the pulsation rate, $ {A}_i^{t+1} $ is the value of loudness at iteration t+1, and $ {r}_i^{t+1} $ is the pulsation rate at iteration t+1. Figure 5 shows the BA flowchart.

Particle swarm optimization

PSO is a powerful optimization algorithm that is widely used in different fields, such as for multiobjective optimization problems (Zhang et al. 2020), feature selection problems (El-Kenawy and Eid 2020), environmental economic dispatch problems (Xin-gang et al. 2020), green coal production problems (Cui et al. 2020), and the training of ANN models (Darwish et al. 2020). The collaborative behavior of the swarm in the PSO algorithm is one of the advantages of PSO with regard to finding optimal solutions. In PSO, first, the random positions and velocities of the particles are initialized. Then, an objective function is computed for each particle so that the best particle position achieved by the population so far is identified. The velocities and positions of the particles are updated based on the following equations. The process continues until the stopping criterion is satisfied.

$$ {v}_{i,d}\left(t+1\right)=\kappa {v}_{i,d}(t)+{c}_1{r}_1\left({p}_{\boldsymbol{best},i,d}-{x}_{i,d}(t)\right)+{c}_2{r}_2\left({g}_{\boldsymbol{best},d}-{x}_{i,d}(t)\right) $$

(28)

$$ {x}_{i,d}\left(t+1\right)={x}_{i,d}(t)+{v}_{i,d}(t) $$

(29)

where v_{i, d}(t + 1) is the velocity of the ith particle in the dth dimension during iteration t+1, c₁ and c₂ are acceleration coefficients, p_{best, i, d} is the personal best position, g_{best, d} is the global best position, x_{i, d}(t + 1) is the position of the ith particle during iteration t+1, and r₁ and r₂ are random numbers.

Multiobjective optimization problems

In a multiobjective optimization problem (MOOP), some objective functions can conflict with each other. A solution to an MOOP cannot be compared with other solutions based on relational operators. One of the most important conceptions in MOOPs is Pareto dominance. Solution A dominates solution B if it has equal values (with at least one better value) on all objectives. The circles in Fig. 6 are better than the squares in Fig. 6 based on the fact that the circles achieve lower objective function values of for a minimization problem with the aim of minimizing both objectives. While the circles dominate the squares, they do not dominate each other. Each MOOP has a set of best non-dominated solutions, namely, the Pareto optimal set. The projection of Pareto optimal solutions in the search space is known as the Pareto optimal front. In an MOOP, there is an external archive in which the non-dominated solutions of MOAs are stored. The next challenge is the selection of a target for each iteration of the algorithm. In fact, the targets are the best positions of the leader, bat, particle, and seagull for the MOSSA, MOBA, MOPSO algorithm, and MOSEOA, respectively. To find the target for updating the positions of other agents, the number of neighboring solutions (NSs) in the neighborhood of each solution is counted. During this phase, the target should be chosen from the set of non-dominated solutions with the least crowded neighborhood. To determine the populated neighborhood, NSs within a certain maximum distance are counted.

$$ \overrightarrow{d}=\frac{\mathrm{ma}\overrightarrow{\mathrm{x}}-\mathrm{mi}\overrightarrow{\mathrm{n}}}{\left(\boldsymbol{Archive}\right)\boldsymbol{size}} $$

(30)

$ \overrightarrow{d} $ is the crowding distance, $ \mathrm{ma}\overrightarrow{\mathrm{x}} $ denotes maximum value for every objective, and $ \mathrm{mi}\overrightarrow{\mathrm{n}} $ represents minimum value for every objective. Based on the computed distances, a rank is assigned to each solution. Then, a roulette wheel is used to choose the target. The solutions with more NSs have higher ranks, and thus, the target is chosen among the solutions with the lowest ranks. Setting the size of the archive is another challenge. Archives can store a limited number of non-dominated solutions. Thus, the solutions with the most crowded neighborhoods are chosen for removal from the archive. Following a similar process, the crowding distances are computed, and a rank is assigned to each solution. A roulette wheel is used for selecting the solutions with the highest ranks (the most crowded neighborhoods). To update the archive of non-dominated solutions, the following rules should be considered:

1-
If one of the archive solutions dominates the external solutions, the external solution should be discarded.
2-
If a solution dominates all non-dominated solutions in the archive, the external solution should be added to the archive.

The MOSOA, MOBA, MOPSO algorithm, and MOSSA are performed based on the following procedure:

1-
The random parameters of the MOAs are defined for each algorithm.
2-
The random positions (velocities) of agents (particles, bats, seagulls, and salps) are defined.
3-
The objective function (OBF) is calculated for each agent.
4-
The non-dominated solutions are determined based on the value of the OBF.
5-
The size of the archive is checked, and if it is full, the solutions with the most NSs are removed from the archives.
6-
The archive should be updated based on the rules mentioned above.
7-
The target is selected from the solutions with the least crowded neighborhoods.
8-
The process is repeated until the satisfaction of an end condition occurs.

Case study

According to the Paris Agreement, Iran has agreed to decrease CO₂ emissions by 4% by 2030. Iran has stated that if there is international support without any risk of sanctions, a 12% reduction in CO₂ emissions is possible. However, there are challenges with respect to decreasing CO₂ emissions. During the 2000s, environmental issues had lower importance than economic and social issues. The developing industrial unit around Tehran (capital of Iran) is one of the main causes of increased CO₂ emissions and air pollutants. Iran is known as the 7th largest CO₂ emitter in the world. The agricultural sector of Iran is one of the important causes of increased CO₂ emissions. There are different parameters in the agricultural sector that affect CO₂ emissions. One of the most important parameters is the gross domestic product (GDP). Different studies have investigated the effect of the GDP on CO₂ emissions. Cowan et al. (2014) stated that there was a Granger causality between the GDP and CO₂ emissions in Brazil during the period from 1990 to 2000. Zubair et al. (2020) investigated the relationship between CO₂ emissions and the GDP in Nigeria. They stated that the CO₂ emissions during the long-term period (1980–2018) in Nigeria decreased with increasing GDP. Additionally, the literature has used the environmental Kuznets curve (Kuznets 1955) to find the relationship between economic growth and environmental quality. Grossman and Krueger (1991) used the Kuznets curve to explain the relationship between environmental degradation and economic growth. Based on the Kuznets curve, first, economic growth degraded the environment. After economic growth reached its maximum value, the growth improved the environment. Additionally, the square of the per capita GDP is one of the most effective parameters for predicting CO₂ emissions (Shabani et al. 2021; Hosseini et al. 2019). Another effective parameter for predicting CO₂ emissions is the Gini coefficient (Cheng et al. 2021; Shabani et al. 2021). The Gini index estimates income inequality in the agricultural sector of Iran. Energy consumption is yet another effective parameter for estimating CO₂ emissions. The amount of fossil fuels utilized in this sector is presented as the energy consumption variable. Ali et al. (2021) stated that increasing fossil fuel usage increases CO₂ emissions. Koengkan et al. (2019) stated that renewable energies should be used as alternatives to fossil fuels because fossil fuels increase CO₂ emissions.

Generally, based on the abovementioned discussions, the following inputs are used in the current study for predicting CO₂ emissions:

1-
GDP
2-
Square of the GDP
3-
Energy use
4-
GINI index

To obtain the real GDP, the nominal values are divided by the producer price index, and the total values are divided by the population to obtain the per capita GDP. However, different input combinations can be provided based on the inputs above. To find the best input combination, the SVM is coupled with the MOAs. Table 2 shows the statistical characteristic data. The annual data for 25 provinces of Iran from 1990 to 2018 are extracted to predict the CO₂ emissions caused by the agricultural sector of Iran. The website of the Statistics Center of Iran is used to collect the dataset.

Table 2 The statistical characteristic of input and output data

Full size table

Hybrid SVM and MOAs

In this section, the MOAs are used to improve the accuracy of the SVM based on the following procedure:

1-
The input data are prepared for the SVM models. Seventy percent of the data are used for training, and 30% of the data are used for testing. These percentages are chosen because they provide the lowest error for the SVM model.
2-
The SVM model uses the training data for predicting CO₂ emissions. If the stopping criterion is satisfied, it proceeds to the testing phase; otherwise, the MOAs are linked to the SVM model.
3-
The MOAs are defined based on the rules in section 2.5. The names of the inputs, the names of the kernel functions, and the initial guesses of the kernel parameters are inserted into the MOAs as decision variables. Three objective functions are used to find the parameter values, the best kernel function, and the best input combination. The RMSE is used as the first objective function for finding kernel parameters. The MAE is used to find the best kernel function. The NSE is used to find the best input combination. In fact, the positions of the agents determine the values of the decision variables.
4-
The Pareto front is created, and the nondominated solutions are placed on the Pareto front. Each solution includes three kinds of information: the best input combination, the values of the parameters, and the best kernel function.
5-
A multicriteria decision process is used to choose the best solution from the Pareto front as the final solution.
6-
The SVM model based on the obtained formations runs again, and the process is repeated.

Inclusive multiple model

Previous studies used competitive models for predicting different target variables, but there are several concerns:

1-
The final outputs of the previous studies were the selections of the worst model and the best model. The best model was suggested for subsequent studies, and the worst models were discarded.
2-
The intercomparisons between the predictive models were never exhaustive.
3-
There was no effort to provide more accurate results based on the synergy among all competitive models used.

In this study, first, SVM-MOSOA, SVM-MOPSO, SVM-MOSSA, and SVM-MOBA are used to predict CO₂ emissions. In the next stage, to increase the accuracy of the outputs, an inclusive multiple model is used to improve the accuracy of the final outputs based on the synergy among the SVM-MOSOA, SVM-MOPSO, SVM-MOSSA, and SVM-MOBA models as follows:

1-
First, the SVM-MOSOA, SVM-MOPSO, SVM-MOSSA, and SVM-MOBA models are used to predict CO₂ emissions.
2-
In the next stage, the outputs of SVM-MOSOA, SVM-MOPSO, SVM-MOSSA, and SVM-MOBA are used as the inputs for the ANN model to predict CO₂ emissions. In fact, the outputs of the first stage are considered lower-order modeling results. The utilization of the inclusive multiple model causes the accuracy of the results to be increased based on the utilization of the advantages of all competitive models. The modeler ensures that the capacities of all models are used to extract the most accurate results possible.

However, the ANN model used in the current study includes one input layer, one hidden layer, and one output layer, as observed in Fig. 7. The input layer receives the outputs of the competitive models as its inputs. The hidden layer processes the received inputs based on the chosen activation function. In this study, the sigmoid function is used as it was one of the successful activation functions used in previous studies (Banadkooki et al. 2020b, Ehteram et al. 2021). The ANN processes data based on the following equation:

$$ Y={b}_0+\sum \limits_{i=1}^{Nh}{\omega}_jf\left({b}_{oj}+\sum \limits_{i=1}^{Nin}{\omega}_{ij}{\boldsymbol{IN}}_{ij}\right) $$

(31)

where b₀ and b_oj are the bias of the output and the hidden layer, respectively; ω_ij is the weight of the ith input in the jth hidden layer neuron; IN_ij is the network input; f is the activation function; ω_j is the weight of the output from hidden neuron j; Nh is the number of hidden neurons; Y is the output; and Nin is the number of inputs.

$$ f(y)=\frac{1}{1+{e}^{-y}} $$

(32)

f (y) is the output of the activation function based on the input y (y: $ {b}_{oj}+\sum \limits_{i=1}^{Nin}{\omega}_{ij}{\boldsymbol{IN}}_{ij} $). In this study, the back-propagation algorithm is used to train the ANN model.

Multicriteria decision model

A Pareto front includes multiple solutions. To choose the best solution, a multicriteria decision model is used. One of the powerful multicriteria decision models is the weighted aggregate sum product assessment (WASPAS) technique, which is widely used in different fields, such as solving solar wind power problems (Nie et al. 2017), fuel technology selection (Rani and Mishra 2020), and the development of smart cities (Khan et al. 2020). To use the WASPAS technique to choose the best solution from the Pareto front, the following steps are considered:

1-
A decision matrix is adjusted so that the values of each criterion (NSE, RMSE, and MAE) are inserted into the matrix for each solution.
2-
The decision matrix is normalized based on the following equations:

For the NSE criterion:

$$ {z}_{ij}^{\ast }=\frac{z_{ij}}{{\mathit{\max}}_i{z}_{ij}} $$

(33)

For the RMSE and MAE criteria:

$$ {z}_{ij}^{\ast }=\frac{{\mathit{\min}}_i.{z}_{ij}}{z_{ij}} $$

(34)

$ {z}_{ij}^{\ast } $ is a normalized value, and z_ij is the performance of the ith alternative with respect to the jth criterion.

3-
In the next stage, the weighted sum model and weighted product model are computed as follows:

$$ {K}_i=\sum \limits_{j=1}^n{\kappa}_{ij}.{z}_{ij} $$

(35)

$$ {L}_i=\prod \limits_{j=1}^n{\left({z}_{ij}^{\ast}\right)}^{w_j} $$

(36)

where K_i is the weighted sum model, L_i is the weighted product model, κ_ij denotes the weight of each criterion, and n is number of criteria.

4-
The aggregated measure is computed as follows:

$$ {S}_i=\sigma {K}_i+\left(1-\sigma \right){L}_i $$

(37)

where σ is a constant parameter (σ:0.5). The solution with the highest value of S_i is chosen as the best solution.

In this study, to evaluate the performances of the tested models, the following indexes are used:

Root mean square error (ideal values are close to zero)

$$ \boldsymbol{RMSE}=\sqrt{\frac{1}{m}\sum \limits_{i=1}^m{\left({CO}_{2 es}-{CO}_{2 ob}\right)}^2} $$

(38)

• Scatter index (SI<0.10: excellent performance, SI:0.10 <SI< 0.20: good performance, SI:0.20 <SI< 0.30: fair performance, and SI>0.30: poor performance (Li et al. 2013):

$$ SI=\frac{\boldsymbol{RMSE}}{C{\overline{O}}_{2 ob}} $$

(39)

• Mean absolute error (MAE):

$$ MAE=\frac{1}{m}\sum \limits_{i=1}^m\left|{CO}_{2 es}-{CO}_{2 ob}\right| $$

(40)

• Nash Sutcliffe efficiency (NSE) (values close to 1 are ideal)

$$ NSE=1-\frac{\sum \limits_{i=1}^m{\left({CO}_{2 es}-{CO}_{2 ob}\right)}^2}{\sum \limits_{i=1}^m\left({CO}_{2 es}-C{\overline{O}}_{2 es}\right)} $$

(41)

• Uncertainty with a 95% confidence level (lowest values are ideal)

$$ {U}_{95}=1.96\sqrt{\left({SD}^2+{\boldsymbol{RMSE}}^2\right)} $$

(42)

• Percentage bias

$$ \boldsymbol{PBIAS}=100\ast \frac{\left[\sum \limits_{i=1}^n{\left({CO}_{2 es}-{CO}_{2 ob}\right)}^2\right]}{\sum \limits_{i=1}^n{CO}_{2 ob}} $$

(43)

where SD is the standard deviation of the residual, m is the number of data points, CO_2ob denotes the observed CO₂ levels, and $ C{\overline{O}}_{2 es} $ is the average estimated $ C{\overline{O}}_{2 es} $.

Additionally, to evaluate the quality of the Pareto fronts obtained by the different MOAs, the following indices are used:

1-
Spacing index: this shows the spread of the computed Pareto front.

$$ SP=\sqrt{\frac{1}{np-1}\sum \limits_{i=1}^{np}\left(\overline{d}-{di}_i\right)} $$

(44)

where np is the number of Pareto solutions, di_i is the Euclidean distance between the two consecutive in the Pareto front, and $ \overline{d} $ is the average distance between solutions. The lowest SP value corresponds to the best algorithm.

Maximum spread (MS): the MS shows the distance between the boundary conditions. The highest MS value corresponds to the best algorithm:

$$ MS=\sqrt{\sum \limits_{i=`}^{nf}\mathit{\max}\left(d\left({r}_i,{t}_i\right)\right)} $$

(45)

where d is a function used to compute the Euclidean distance, r_i is the maximum value in the i-th objective function, t_i is the minimum value in the ith objective function, and nf is the number of objective functions.

Discussion and results

Determination of random algorithmic parameters

The MOAs used have random parameters, and their accurate values should be determined because the accuracies of MOAs depend on the random parameters chosen. One of the most robust methods for designing parameters and experiments is the Taguchi model. The Taguchi model is widely used in different fields, such as for optimizing the tuning parameters of models (Dutta and Kumar Reddy Narala 2021), optimizing thermal conductivity (Aswad et al. 2020), optimizing air distributor channels (Feng et al. 2020), optimizing soil erosion (Zhang et al. 2021a), and optimizing runoff water quality (Liu et al. 2021). To determine the values of the random parameters, the following steps are considered:

1-
The levels of the parameters and the number of parameters should be determined. For example, the population size and the maximum number of iterations should be determined for the MOSOA. Thus, there are two parameters. As observed in Tables 3a and 4 values are assigned to the population size and the maximum number of iterations to find the best values of these random parameters.
2-
Regarding the number of parameters and their levels, an orthogonal array is chosen from the Taguchi table. The Taguchi model uses the orthogonal array to decrease the required number of experiments. Regarding the two parameters with four levels each in the MOSOA, L₁₆ covers the required experiments for the parameters of the MOSOA. As observed in Table 3b, there are 16 experiments for two parameters with four levels.
3-
In the next stage, the signal (S)-to-noise (N) ratio is computed for each experiment:

Table 3 (a) The level and factors of MOSOA, (b) the S/N value of 16 experiments, (c) the mean S/N value for different parameters, and (d) the optimal value of other parameters of other MOAs

Full size table

Table 4 The obtained information by the best solution of Pareto fronts for (a) the selection of kernel function, (b) the values of SVM parameters, and (c) the best input combination

Full size table

$$ \frac{S}{N}\boldsymbol{ratio}=-10{\mathit{\log}}_{1o}{\left(\boldsymbol{objective}\left(\boldsymbol{function}\right)\right)}^2 $$

(44)

High values of the S/N are ideal.

4-
The mean of the S/N values is computed for factors at the different levels and computed as follows:

$$ {\left(\boldsymbol{Mean}\right)}_{\boldsymbol{factor}=l}^{\boldsymbol{level}=i}=\frac{1}{n_{ij}}\sum \limits_{j=1}^{n_{ij}}\left[{\left(\frac{S}{N}\right)}_{\boldsymbol{factor}=l}^{\boldsymbol{level}=i}\right] $$

(45)

where Mean S/N denotes the mean signal-to-noise ratio.

Table 3b shows the S/N values for 16 experiments. The average S/N for each of the levels of the parameters is computed in Table 3c. The highest mean S/N values correspond to the best parameter values. As observed in Table 3c, the population size and the maximum number of iterations in level 1 have the highest mean S/Ns. Thus, the best values for the population size and the maximum number of iterations are 14.26 and 15.33, respectively. Obtained via a similar process, the optimal values of the other parameters for the other compared MOAs are reported in Table 3d.

The Pareto fronts of different models

Figure 8 shows the Pareto fronts obtained for different models. As observed in Fig. 8, the RMSE, MAE, and NSE of SVM-MOSOA vary from 0.25 to 0.75 (kg), from 0.12 to 0.62 (kg), and from 0.92 to 0.98, respectively. The RMSE, MAE, and NSE of SVM-SSA vary from 0.35 to 10.05 (kg), from 0.24 to 0.72, and from 0.91 to 0.97, respectively. The RMSE and MAE of SVM-MOSOA for the non-dominated solutions are lower than those of the other models. The red circle shows the best solution obtained by the multicriteria decision process. As observed in Table 4, the best solution determines the best input scenario, the values of the SVM parameters, and the best kernel function. Table 4a shows the best kernel functions for the different SVM models. The best kernel function for SVM-MOSOA is the RBF, and it is the sigmoid function for SVM-MOSSA, SVM-MOBA, and SVM-MOPSO. The values of the SVM parameters are shown in Table 4b, and the best input combinations are shown in Table 4c. Figure 9 compares the quality of the Pareto fronts obtained by the different models. As observed in Fig. 9, MOSOA provides the lowest SP and the highest MS among the models. The MOPSO provides the lowest MS and the highest SP. Thus, MOPSO and MOSOA provide the worst and the best Pareto fronts among the tested models, respectively.

Comparison of model accuracies with respect to predicting CO₂ emissions

Table 5 compares the performances of different models. The RMSE of SVM-MOSOA during the training stage is 16, 55, 57, and 60% lower than those of the SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models. The MAE obtained by SVM-MOSOA is 0.29, and they are 0.32, 0.69, and 0.91 for the SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models, respectively. SVM-MOSOA obtains the highest NSE and the lowest PBIAS during the training stage among the different SVM-MOA models. If the accuracies of the SVM-MOA models are compared with those when the IMM model is added, it is clear that the IMM model increases the accuracy of each of the SVM-MOA models during the training stage. The IMM model decreases the MAEs of the SVM-SOA, SVM-MOSSA, SVM-MOBA, and SVM-PSO models by 24, 31, 68, and 76%, respectively. The model performance comparison indicates that the IMM and SVM-MOBA have the highest and lowest NSEs, respectively, among the tested models during the training phase. The performances of the models indicate that SVM-MOSOA has the best accuracy among the SVM-MOA models. The NSE of SVM-MOSOA is 0.93, and they are 0.90, 0.89, and 0.84 for the SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models, respectively. The performances of the models indicate that the IMM model outperforms all SVM-MOA models in the testing phase. The model accuracy comparison indicates that the IMM and SVM-MOPSO have the lowest and highest PBIAS among the models, respectively. As observed in Table 5, the IMM model decreases the RMSEs of the SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models by 20, 36, 47, and 50%, respectively. Figure 10a compares the performances of the models based on the IS index. As observed in this figure, the SI values of the IMM model are 0.04 and 0.060 during the training and testing stages, respectively, and thus, the performance of the IMM model is excellent. The performances of SVM-MOSOA, SVM-SSA, SVM-BA, and SVM-PSO are good during the training and testing phases. Figure 10b compares the performance of the models based on the U95% measure. As observed in Fig. 10b, it is clear that the IMM and SVM-MOSOA provide the lowest uncertainty levels in comparison with the other models. Figure 11 shows the scatterplots obtained using testing data. Additionally, the R² values obtained on the training data are mentioned for each model. The R² of the IMM is 0.995, while they are 0.9899, 0.9830, 0.9798, and 0.9752 for the SVM-MOSOA, SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models, respectively. Thus, the IMM has the highest R² values. Taylor diagrams are effective diagrams for comparing models. A Taylor diagram uses three evaluation criteria, namely, the standard deviation, root mean square error, and correlation coefficient, to evaluate the accuracies of the compared models. A model has a better performance than those of other models if its point is closest to the reference point (observed data). As observed in Fig. 12, the IMM and SVM-MOSOA have better performances than the other models. Figure 13 shows the boxplots of the models. As observed in this figure, the IMM and SVM-MOSOA have good agreement with the measured data. The relative error of each data point for the different models is computed based on the heat map in Fig. 14. As observed in Fig. 14, the relative error induced by the IMM model varies from 10 to 20%, and this model has the lowest relative errors among the tested models. The relative error of SVM-MOBA varies from 40 to 60% and 50 to 60%for the SVM-MOPSO model.

Table 5 The comparison of the accuracy of different models

Full size table

Concluding discussion

Regarding the obtained results, the following points should be considered:

1-
One of the main advantages of the current study is that it finds the best input combination without preprocessing data, such as through the use of the gamma test and principal component analysis. Each multiobjective algorithm can automatically find the best input combination with respect to a defined objective function.
5-
The findings of the current research confirm the results from previous article by Ahmadi et al. (2019b), who showed the optimization algorithm could improve the accuracy of SVM and LSSVM models for predicting CO₂ emission. They stated that the optimization algorithms such as PSO and GA had the high abilities for improving the accuracy of soft computing models such as SVM and LSSVM model. Sun and Liu (2016) used the LSSVM as a family member of SVM model to predict CO₂ emission. They reported that the LSSVM model outperformed the backpropagation neural network and grey models. Shabani et al. (2021) confirmed that the square of the GDP, GDP, energy use and GINI index were the effective inputs for predicting CO₂ emission. A comparison of the results obtained by the current study and Shabani et al. (2021) revealed accurate predictions with least errors for IMM model.
2-
The findings of the current study confirm the results from previous studies by Ehteram et al. (2021) and Seifi et al. (2020), who showed the high potential of multiobjective algorithms for improving the performances of MLAs.
3-
The results of the current study are similar to those of Shabani et al. (2021) and Khatibi et al. (2017), who showed that the IMM-based models provide high accuracy for the prediction of target variables.
4-
Future studies can investigate the effects of uncertain inputs and model parameters on the accuracies of the resulting models.
5-
All of the evaluation criteria show that the performances of the IMM and SVM-MOSOA are the best, but future studies can use the multicriteria model to assign a rank to each model and then select the one with the best performance.
6-
The performances of the models used in the current study are not limited to theoretical aspects; policy-makers and decision-makers can utilize the tools used here to identify the effective parameters of CO₂ emissions and then find real relationships between the inputs and outputs. Thus, the models are suitable for environmental management and provide good insights for predicting CO₂ emissions.
7-
However, it should be noted that there are challenges with improving MLA models based on MOAs, such as converting a single objective optimization algorithm to multiobjective optimization algorithms, defining various objective functions, and selecting the best solution on the given Pareto front.
8-
Depending on the kinds of MLAs used in a given study, objective functions can be defined. For example, the number of hidden layers, the ANN parameters (weight and bias), the selection of the activation function, and the best input combination selection can be defined as the decision variables for an ANN model. Thus, four objective functions can be defined for the model under study.
9-
The increasing number of inputs and model parameters may increase the computational complexities and time consumptions of the models.
10-
Another ability of the introduced models is that they provide spatial maps of CO₂ emissions so that the regions with high and low CO₂ emissions are classified accurately.
11-
An ensemble of different kinds of kernel functions for SVMs can also be one of the strategies for increasing the accuracy of the SVM model because the model can use the advantages of different kernel functions.

Conclusion

Predicting CO₂ emissions is a real challenge for policy-makers and decision-makers with respect to managing the environment. CO₂ emissions rely on different parameters, and thus, finding accurate relationships between model inputs and outputs is an important issue. The agricultural sector is one of the important sources of CO₂ production. Thus, the current study used SVM models for predicting CO₂ emissions based on data from the agricultural sector. In this study, some MOAs were used to improve the accuracies of several SVM models based on finding the best input combinations, model parameters and kernel functions. Then, an IMM model used the outputs of the models as its inputs to increase their accuracies. The best input combination for all models was the use of the Gini index, GDP, GDP², and EU. The SVM-MOA models selected different kernel functions for mapping complex relationships between the inputs and outputs. The results indicated that the SOA provided the best Pareto front among the tested MOAs. It was observed that the IMM model decreased the MAEs of the SVM-MOSOA, SVM-MOSSA, SVM-MOBA, and SVM-PSO models by 24, 31, 75, and 76%, respectively. Additionally, SVM-MOSOA achieved the highest accuracy among the SVM-MOA models. The MAE obtained by SVM-MOSOA was 0.29, and they were 0.32, 0.69, and 0.91 for the SVM-MOSSA, SVM-MOBA, and SVM-MOPSO models, respectively. The general results indicated that the IMM model could significantly increase the accuracies of the SVM models. The results of the current study could be useful for decreasing pollutants in the environments of different countries. The next papers can cover the deficiencies of the current paper. They can test the models for predicting CO₂ in the different regions of the world. Also, they can add the other socioeconomic to the inputs to consider the effect of different inputs on the accuracy of the models.

Data availability

The datasets used and/ or analyzed during the current study are available from the corresponding author on reasonable request

References

Abd el-sattar S, Kamel S, Ebeed M, Jurado F (2021) An improved version of salp swarm algorithm for solving the optimal power flow problem. Soft Comput 25:4027–4052. https://doi.org/10.1007/s00500-020-05431-4
Article Google Scholar
Ahmadi MH, Jashnani H, Chau KW, Kumar R, & Rosen MA (2019a) Carbon dioxide emissions prediction of five Middle Eastern countries using artificial neural networks. Energy Sources, Part A: Recovery, Utilization, and Environmental Effects, 1–13. https://doi.org/10.1080/15567036.2019.1679914
Ahmadi MH, Madvar MD, Sadeghzadeh M, Rezaei MH, Herrera M, & Shamshirband, S. (2019b) Current status investigation and predicting carbon dioxide emission in Latin American countries by connectionist models. Energies, 12(10), 1916. https://doi.org/10.3390/en12101916
Ali MU, Gong Z, Ali MU, Wu X, & Yao C (2021) Fossil energy consumption, economic development, inward FDI impact on CO2 emissions in Pakistan: Testing EKC hypothesis through ARDL model. International Journal of Finance & Economics, 26(3), 3210–3221. https://doi.org/10.1002/ijfe.1958
Aswad MA, Saud AN, Ahmed MA (2020) Thermal conductivity optimization of porous alumina ceramics via Taguchi model. Mater Sci Forum 1002:125–131. https://doi.org/10.4028/www.scientific.net/MSF.1002.125
Article Google Scholar
Banadkooki FB, Ehteram M, Ahmed AN, Teo FY, Ebrahimi M, Fai CM, Huang YF, El-Shafie A (2020a) Suspended sediment load prediction using artificial neural network and ant lion optimization algorithm. Environ Sci Pollut Res 27:38094–38116. https://doi.org/10.1007/s11356-020-09876-w
Article Google Scholar
Banadkooki FB, Ehteram M, Panahi F, Sh. Sammen S, Othman FB, EL-Shafie A (2020b) Estimation of total dissolved solids (TDS) using new hybrid machine learning models. J Hydrol 587:124989. https://doi.org/10.1016/j.jhydrol.2020.124989
Article CAS Google Scholar
Banadkooki FB, Singh VP, Ehteram M (2021) Multi-timescale drought prediction using new hybrid artificial neural network models. Nat Hazards 106:2461–2478. https://doi.org/10.1007/s11069-021-04550-x
Article Google Scholar
Cao Y, Li Y, Zhang G, Jermsittiparsert K, Razmjooy N (2019) Experimental modeling of PEM fuel cells using a new improved seagull optimization algorithm. Energy Rep 5:1616–1625. https://doi.org/10.1016/j.egyr.2019.11.013
Article Google Scholar
Chakri A, Khelif R, Benouaret M, Yang XS (2017) New directional bat algorithm for continuous optimization problems. Expert Syst Appl 69:159–175. https://doi.org/10.1016/j.eswa.2016.10.050
Article Google Scholar
Cheng S, Fan W, Zhang J, Wang N, Meng F, Liu G (2021) Multi-sectoral determinants of carbon emission inequality in Chinese clustering cities. Energy 214:118944. https://doi.org/10.1016/j.energy.2020.118944
Article Google Scholar
Cowan WN, Chang T, Inglesi-Lotz R, Gupta R (2014) The nexus of electricity consumption, economic growth and CO2 emissions in the BRICS countries. Energy Policy 66:359–368. https://doi.org/10.1016/j.enpol.2013.10.081
Article Google Scholar
Cui Z, Zhang J, Wu D, Cai X, Wang H, Zhang W, Chen J (2020) Hybrid many-objective particle swarm optimization algorithm for green coal production problem. Inf Sci 518:256–271. https://doi.org/10.1016/j.ins.2020.01.018
Article Google Scholar
Darabi H, Mohamadi S, Karimidastenaei Z, Kisi O, Ehteram M, ELShafie A, Torabi Haghighi A (2021) Prediction of daily suspended sediment load (SSL) using new optimization algorithms and soft computing models. Soft Comput 25:7609–7626. https://doi.org/10.1007/s00500-021-05721-5
Article Google Scholar
Darwish A, Ezzat D, Hassanien AE (2020) An optimized model based on convolutional neural networks and orthogonal learning particle swarm optimization algorithm for plant diseases diagnosis. Swarm Evol Comput 52:100616. https://doi.org/10.1016/j.swevo.2019.100616
Article Google Scholar
Deotti LMP, Pereira JLR, da Silva Júnior IC (2020) Parameter extraction of photovoltaic models using an enhanced Lévy flight bat algorithm. Energy Convers Manag 221:113114. https://doi.org/10.1016/j.enconman.2020.113114
Article Google Scholar
Dhiman G, Kumar V (2019) Seagull optimization algorithm: Theory and its applications for large-scale industrial engineering problems. Knowl-Based Syst 165:169–196. https://doi.org/10.1016/j.knosys.2018.11.024
Article Google Scholar
Dhiman G, Singh KK, Soni M, Nagar A, Dehghani M, Slowik A, Kaur A, Sharma A, Houssein EH, Cengiz K (2020) MOSOA: A new multi-objective seagull optimization algorithm. Expert Syst Appl 167:114150. https://doi.org/10.1016/j.eswa.2020.114150
Article Google Scholar
Dong J, Wu L, Liu X, Li Z, Gao Y, Zhang Y, Yang Q (2020) Estimation of daily dew point temperature by using bat algorithm optimization based extreme learning machine. Appl Therm Eng 165:114569. https://doi.org/10.1016/j.applthermaleng.2019.114569
Article Google Scholar
Duan H, Luo X (2020) Grey optimization Verhulst model and its application in forecasting coal-related CO2 emissions. Environ Sci Pollut Res 27:43884–43905. https://doi.org/10.1007/s11356-020-09572-9
Article CAS Google Scholar
Dutta S, & Kumar Reddy Narala S (2021). Optimizing turning parameters in the machining of AM alloy using Taguchi methodology. Measurement, 169, 108340. https://doi.org/10.1016/j.measurement.2020.108340
Ehteram M, Salih SQ, Yaseen ZM (2020) Efficiency evaluation of reverse osmosis desalination plant using hybridized multilayer perceptron with particle swarm optimization. Environ Sci Pollut Res 27:15278–15291. https://doi.org/10.1007/s11356-020-08023-9
Article CAS Google Scholar
Ehteram M, Ahmed AN, Latif SD, Huang YF, Alizamir M, Kisi O, Mert C, El-Shafie A (2021) Design of a hybrid ANN multi-objective whale algorithm for suspended sediment load prediction. Environ Sci Pollut Res 28:1596–1611. https://doi.org/10.1007/s11356-020-10421-y
Article CAS Google Scholar
El-Kenawy ES, & Eid M (2020). Hybrid gray wolf and particle swarm optimization for feature selection. Int J Innov Comput Inf Control. doi: https://doi.org/10.24507/ijicic.16.03.831
Feng G, Lei S, Guo Y, Shi D, Shen JB (2020) Optimisation of air-distributor channel structural parameters based on Taguchi orthogonal design. Case Stud Thermal Eng 21:100685. https://doi.org/10.1016/j.csite.2020.100685
Article Google Scholar
Grossman G, & Krueger A (1991). Environmental Impacts of a North American Free Trade Agreement (No. 3914). National Bureau of Economic Research, Inc. https://doi.org/10.3386/w3914
Hosseini SM, Saifoddin A, Shirmohammadi R, Aslani A (2019) Forecasting of CO2 emissions in Iran based on time series and regression analysis. Energy Rep 5:619–631. https://doi.org/10.1016/j.egyr.2019.05.004
Article Google Scholar
Jia H, Xing Z, Song W (2019) A new hybrid seagull optimization algorithm for feature selection. IEEE Access 7:49614–49631. https://doi.org/10.1109/ACCESS.2019.2909945
Article Google Scholar
Jiang H, Yang Y, Ping W, Dong Y (2020) A novel hybrid classification method based on the oppositionbased seagull optimization algorithm. IEEE Access, 8, 100778-100790.(1) https://doi.org/10.1109/ACCESS.2020.2997791.
Kandiri A, Mohammadi Golafshani E, Behnood A (2020) Estimation of the compressive strength of concretes containing ground granulated blast furnace slag using hybridized multi-objective ANN and salp swarm algorithm. Constr Build Mater 248:118676. https://doi.org/10.1016/j.conbuildmat.2020.118676
Article Google Scholar
Khan S, Haleem A, & Khan MI (2020). Analysing Challenges Towards Development of Smart City Using WASPAS. Smart Cities—Opportunities and Challenges, 463–474. https://doi.org/10.1007/978-981-15-2545-2_39
Khatibi R, Ghorbani MA, Pourhosseini FA (2017) Stream flow predictions using nature-inspired Firefly Algorithms and a Multiple Model strategy – Directions of innovation towards next generation practices. Adv Eng Inform 34:80–89. https://doi.org/10.1016/j.aei.2017.10.002
Article Google Scholar
Khoshnevisan B, Rafiee S, Omid M, Yousefi M, Movahedi M (2013) Modeling of energy consumption and GHG (greenhouse gas) emissions in wheat production in Esfahan province of Iran using artificial neural networks. Energy 52:333–338. https://doi.org/10.1016/j.energy.2013.01.028
Article Google Scholar
Koengkan M, Santiago R, Fuinhas JA (2019) The impact of public capital stock on energy consumption: Empirical evidence from Latin America and the Caribbean region. Int Econ 160:43–55. https://doi.org/10.1016/j.inteco.2019.09.001
Article Google Scholar
Kuznets S (1955) Economic growth and income inequality. The American economic review, 45(1), 1–28
Li MF, Tang XP, Wu W, Liu HB (2013) General models for estimating daily global solar radiation for different solar radiation zones in mainland China. Energy Convers Manag 70:139–148. https://doi.org/10.1016/j.enconman.2013.03.004
Article Google Scholar
Li J, Zhang B, & Shi J (2017) Combining a genetic algorithm and support vector machine to study the factors influencing CO2 emissions in Beijing with scenario analysis. Energies, 10(10), 1520. https://doi.org/10.3390/en10101520
Liu W, Engel BA, Chen W, Wei W, Wang Y, Feng Q (2021) Quantifying the contributions of structural factors on runoff water quality from green roofs and optimizing assembled combinations using Taguchi method. J Hydrol 593:125864. https://doi.org/10.1016/j.jhydrol.2020.125864
Article CAS Google Scholar
Mugemanyi S, Qu Z, Rugema FX, Dong Y, Bananeza C, Wang L (2020) Optimal Reactive Power Dispatch Using Chaotic Bat Algorithm. IEEE Access 8:65830–65867. https://doi.org/10.1109/ACCESS.2020.2982988
Article Google Scholar
Nie R xin, Wang J qiang, & Zhang H yu. (2017). Solving solar-wind power station location problem using an extended weighted aggregated sum product assessment (WASPAS) technique with interval neutrosophic sets. Symmetry, 9(7), 106. https://doi.org/10.3390/sym9070106
Rani P, Mishra AR (2020) Multi-criteria weighted aggregated sum product assessment framework for fuel technology selection using q-rung orthopair fuzzy sets. Sustain Prod Consum 24:90–104. https://doi.org/10.1016/j.spc.2020.06.015
Article Google Scholar
Sain SR, Vapnik VN (1996) The Nature of Statistical Learning Theory. Technometrics 38:409. https://doi.org/10.2307/1271324
Article Google Scholar
Saleh C, Dzakiyullah NR, & Nugroho JB (2016, February) Carbon dioxide emission prediction using support vector machine. In IOP Conference Series: Materials Science and Engineering (Vol. 114, No. 1, p. 012148). IOP Publishing. https://doi.org/10.1088/1757-899X/114/1/012148
Salgotra R, Singh U, Singh S, Singh G, Mittal N (2021) Self-adaptive salp swarm algorithm for engineering optimization problems. Appl Math Model 89:188–207. https://doi.org/10.1016/j.apm.2020.08.014
Article Google Scholar
Seifi A, Ehteram M, Soroush F (2020) Uncertainties of instantaneous influent flow predictions by intelligence models hybridized with multi-objective shark smell optimization algorithm. J Hydrol 587:124977. https://doi.org/10.1016/j.jhydrol.2020.124977
Article Google Scholar
Shabani E, Hayati B, Pishbahar E, Ghorbani MA, Ghahremanzadeh M (2021) A novel approach to predict CO2 emission in the agriculture sector of Iran based on Inclusive Multiple Model. J Clean Prod 279:123708. https://doi.org/10.1016/j.jclepro.2020.123708
Article CAS Google Scholar
Shi Y, Han B, Zafar MW, Wei Z (2019a) Uncovering the driving forces of carbon dioxide emissions in Chinese manufacturing industry: An intersectoral analysis. Environ Sci Pollut Res 26:31434–31448. https://doi.org/10.1007/s11356-019-06303-7
Article CAS Google Scholar
Shi Y, Han B, Han L, Wei Z (2019b) Uncovering the national and regional household carbon emissions in China using temporal and spatial decomposition analysis models. J Clean Prod 232:966–979. https://doi.org/10.1016/j.jclepro.2019.05.302
Article Google Scholar
Sun W, Liu M (2016) Prediction and analysis of the three major industries and residential consumption CO2 emissions based on least squares support vector machine in China. J Clean Prod 122:144–153. https://doi.org/10.1016/j.jclepro.2016.02.053
Article Google Scholar
Sun W, Wang C, Zhang C (2017) Factor analysis and forecasting of CO2 emissions in Hebei, using extreme learning machine based on particle swarm optimization. J Clean Prod 162:1095–1101. https://doi.org/10.1016/j.jclepro.2017.06.016
Article Google Scholar
Tubishat M, Ja’afar S, Alswaitti M, Mirjalili S, Idris N, Ismail MA, Omar MS (2021) Dynamic Salp swarm algorithm for feature selection. Expert Syst Appl 164:113873. https://doi.org/10.1016/j.eswa.2020.113873
Article Google Scholar
Wang Y, Ding G, & Liu L (2015, October) A regression forecasting model of carbon dioxide concentrations based-on principal component analysis-support vector machine. In International Conference on Geo-Informatics in Resource Management and Sustainable Ecosystem (pp. 447–457). Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-45737-5_45
Wang Y, Wang P, Zhang J, Cui Z, Cai X, Zhang W, & Chen J (2019). A novel bat algorithm with multiple strategies coupling for numerical optimization. Mathematics, 7(2), 135. https://doi.org/10.3390/math7020135
Wu Q, Meng F (2020) Prediction of energy‐related CO2 emissions in multiple scenarios using a least square support vector machine optimized by improved bat algorithm: a case study of China. Greenh. Gases Sci. Technol, 10(1), 160–175. https://doi.org/10.1002/ghg.1939
Xin-gang Z, Ji L, Jin M, Ying Z (2020) An improved quantum particle swarm optimization algorithm for environmental economic dispatch. Expert Syst Appl 152:113370. https://doi.org/10.1016/j.eswa.2020.113370
Article Google Scholar
Xu G, Schwarz P, Yang H (2019) Determining China’s CO2 emissions peak with a dynamic nonlinear artificial neural network approach and scenario analysis. Energy Policy 128:752–762. https://doi.org/10.1016/j.enpol.2019.01.058
Article Google Scholar
Yue X, Zhang H (2020) Modified hybrid bat algorithm with genetic crossover operation and smart inertia weight for multilevel image segmentation. Appl Soft Comput J 90:106157. https://doi.org/10.1016/j.asoc.2020.106157
Article Google Scholar
Zhang XW, Liu H, Tu LP (2020) A modified particle swarm optimization for multimodal multi-objective optimization. Eng Appl Artif Intell 95:103905. https://doi.org/10.1016/j.engappai.2020.103905
Article Google Scholar
Zhang H, Wang Z, Chen W, Heidari AA, Wang M, Zhao X, Liang G, Chen H, Zhang X (2021a) Ensemble mutation-driven salp swarm algorithm with restart mechanism: Framework and fundamental analysis. Expert Syst Appl 165:113897. https://doi.org/10.1016/j.eswa.2020.113897
Article Google Scholar
Zhang F, Wang M, Yang M (2021b) Successful application of the Taguchi method to simulated soil erosion experiments at the slope scale under various conditions. Catena 196:104835. https://doi.org/10.1016/j.catena.2020.104835
Article Google Scholar
Zhao H, Huang G, & Yan N (2018) Forecasting energy-related CO2 emissions employing a novel SSA-LSSVM model: considering structural factors in China. Energies, 11(4), 781. https://doi.org/10.3390/en11040781
Zubair AO, Abdul Samad A-R, Dankumo AM (2020) Does gross domestic income, trade integration, FDI inflows, GDP, and capital reduces CO2 emissions? An empirical evidence from Nigeria. Environ. Sustain 2:100009. https://doi.org/10.1016/j.crsust.2020.100009

Download references

Funding

'Not applicable

Author information

Authors and Affiliations

Department of Water Engineering and Hydraulic Structures, Faculty of Civil 4 Engineering, Semnan University, Semnan, Iran
Mohammad Ehteram
Department of Civil Engineering, College of Engineering, University of Diyala, Baqubah, Diyala Governorate, Iraq
Saad Sh. Sammen
Faculty of Natural Resources and Earth Sciences, University of Kashan, Kashan, Iran
Fatemeh Panahi
Institute of Energy Infrastructure, Universiti Tenaga Nasional, Jalan IKRAM-UNITEN, 43000, Kajang, Selangor, Malaysia
Lariyah Mohd Sidek

Authors

Mohammad Ehteram
View author publications
You can also search for this author in PubMed Google Scholar
Saad Sh. Sammen
View author publications
You can also search for this author in PubMed Google Scholar
Fatemeh Panahi
View author publications
You can also search for this author in PubMed Google Scholar
Lariyah Mohd Sidek
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Conceptualization: Mohammad ehteram, Fatemeh Panahi; Saad Sh. Sammen Methodology: Saad Sh. Sammen, Mohammad Ehteram; Formal analysis and investigation: Mohammad Ehteram, Lariyah Mohd Sidek Writing original draft preparation: Mohammad Ehteram, Lariyah Mohd Sidek

Corresponding author

Correspondence to Saad Sh. Sammen.

Ethics declarations

Ethical approval

Not applicable

Competing interests

The authors declare that they have no conflict interest.

Research involving human participants and/ or animals

Not applicable

Consent to participate

All authors have given consent to their contribution.

Consent to publish

All authors have agreed with the content and all have given explicit consent to publish.

Additional information

Responsible Editor: Marcus Schulz

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ehteram, M., Sammen, S.S., Panahi, F. et al. A hybrid novel SVM model for predicting CO₂ emissions using Multiobjective Seagull Optimization. Environ Sci Pollut Res 28, 66171–66192 (2021). https://doi.org/10.1007/s11356-021-15223-4

Download citation

Received: 07 April 2021
Accepted: 27 June 2021
Published: 30 July 2021
Issue Date: December 2021
DOI: https://doi.org/10.1007/s11356-021-15223-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

A hybrid novel SVM model for predicting CO2 emissions using Multiobjective Seagull Optimization

Abstract

Similar content being viewed by others

China’s carbon dioxide emission forecast based on improved marine predator algorithm and multi-kernel support vector regression

Support Vector Machine Modeling Using Particle Swarm Optimization Approach for the Retrieval of Atmospheric Ammonia Concentrations

Support vector regression optimized by meta-heuristic algorithms for daily streamflow prediction

Explore related subjects

Introduction

Materials and methods

Structure of the support vector machine

Seagull optimization algorithm

(1) Migration phase

(2) Attack phase

Salp swarm algorithm

Bat algorithm

Particle swarm optimization

Multiobjective optimization problems

Case study

Hybrid SVM and MOAs

Inclusive multiple model

Multicriteria decision model

Discussion and results

Determination of random algorithmic parameters

The Pareto fronts of different models

Comparison of model accuracies with respect to predicting CO2 emissions

Concluding discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval

Competing interests

Research involving human participants and/ or animals

Consent to participate

Consent to publish

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

A hybrid novel SVM model for predicting CO₂ emissions using Multiobjective Seagull Optimization

Comparison of model accuracies with respect to predicting CO₂ emissions