A novel chaotic salp swarm algorithm for global optimization and feature selection

Sayed, Gehad Ismail; Khoriba, Ghada; Haggag, Mohamed H.

doi:10.1007/s10489-018-1158-6

A novel chaotic salp swarm algorithm for global optimization and feature selection

Published: 07 March 2018

Volume 48, pages 3462–3481, (2018)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

A novel chaotic salp swarm algorithm for global optimization and feature selection

Download PDF

3921 Accesses
319 Citations
Explore all metrics

Abstract

Salp Swarm Algorithm (SSA) is one of the most recently proposed algorithms driven by the simulation behavior of salps. However, similar to most of the meta-heuristic algorithms, it suffered from stagnation in local optima and low convergence rate. Recently, chaos theory has been successfully applied to solve these problems. In this paper, a novel hybrid solution based on SSA and chaos theory is proposed. The proposed Chaotic Salp Swarm Algorithm (CSSA) is applied on 14 unimodal and multimodal benchmark optimization problems and 20 benchmark datasets. Ten different chaotic maps are employed to enhance the convergence rate and resulting precision. Simulation results showed that the proposed CSSA is a promising algorithm. Also, the results reveal the capability of CSSA in finding an optimal feature subset, which maximizes the classification accuracy, while minimizing the number of selected features. Moreover, the results showed that logistic chaotic map is the optimal map of the used ten, which can significantly boost the performance of original SSA.

Feature Selection Using Chaotic Salp Swarm Algorithm for Data Classification

Article 14 December 2018

A new improved salp swarm algorithm using logarithmic spiral mechanism enhanced with chaos for global optimization

Article 15 March 2021

Differential evolution-assisted salp swarm algorithm with chaotic structure for real-world problems

Article 10 January 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Recently, meta-heuristic algorithms have become surprisingly very popular. This is due to proving their superiority in solving many optimization problems. Some of the most popular meta-heuristic algorithms are Particle Swarm Optimization (PSO) [1], Artificial Bee Colony (ABC) [2] and Ant Colony Optimization [3]. Regardless of the different structure of meta-heuristic algorithms, they are divided into two main phases. These phases are exploitation and exploration phases. In exploration phase, an optimizer algorithm can efficiently discover the search space, mostly by randomization. However, during this phase, the population may face abrupt changes. In exploitation phase, an optimizer algorithm converges toward the most promising region. But, in most of the cases, there is no clear boundary between these two phases, which makes meta-heuristic algorithms trapped into local optima. This is due to their stochastic nature and improper balancing between exploration and exploitation. Thus, many studies have been presented to improve the performance of meta-heuristic algorithms and to overcome this problem. Most of these studies used chaotic theory in their system [4,5,6,7].

Chaos is one of the most common mathematical approaches recently employed to boost the performance of meta-heuristic algorithms. It is defined as the simulation of the dynamic behavior of nonlinear systems [8]. Over the years, chaos has attracted a great attention and has been applied in different sciences such as chaos control [9], synchronization [10], optimization researches [11, 12] and so on. Each hyperchaotic system has its own dynamic behavior characteristics, which can be applied on certain applications [13]. Authors in [14] proposed a new chaotic complex model, namely cutting edge chaotic complex L$\ddot {u}$ system, where a complex nonlinear expression is added to the third equation of the L$\ddot {u}$ system. Also, the authors present complex antilag synchronization (CALS) schema. They applied their proposed CALS for two indistinguishable chaotic complex systems with different introductory qualities. The results show the effectiveness of their proposed complex system. Authors in [15] proposed another hyperchaotic system, where fractional-order is used. The experimental results show that the effectiveness of their proposed system in terms of equilibrium points, Lyapunov spectrum, and attractor forms. Moreover, the results show that their system exhibits larger Lyapunov exponents compared with other hyperchaotic systems. Authors in [16] applied the ten-term chaotic system without equilibrium on multimedia security application including image encryption and sound steganography. The results show that their system is able to encrypt 128kbit data and hide in sound files. Also, the properties of chaos is embedded with the searching mechanism of meta-heuristic algorithms and applied in different applications. Authors in [17] employed logistic chaotic map with Genetic Algorithm (GA) for encrypting the image. In this approach, the logistic chaotic map is used to encode the initial image, after that GA is applied to improve the encryption results. The simulation results reveal the capability of their approach in avoiding the frequent attacks. Authors in [5] hybridized chaos optimization algorithm (COA) and PSO. The experimental results show that embedding logistic chaotic map can significantly improve the performance of PSO in terms of convergence rate and time oscillation. Another approach based on hybridization with chaotic maps is proposed in [6]. In this approach, the authors employed ten chaotic maps with Biogeography-Based Optimization (BBO). These maps are gauss/mouse, Chebyshev, logistic, iterative, piecewise, sine, singer, circle, sinusoidal and tent chaotic maps. The simulation results show that the proposed chaotic biogeography-based optimization is able to improve the performance of original BBO for both exploration and exploitation. Authors in [18] employed piecewise chaotic map with search iteration of Harmony Search (HS). Their main objective is to minimize makespan for the permutation flow shop scheduling problem with limited buffers. The experimental results show the efficiency of their proposed chaotic harmony search algorithm compared with the original HS and other meta-heuristic algorithms in terms of solution quality and robustness. A modified version of Antlion Optimization Algorithm (ALO) based on chaotic maps, namely Chaotic Antlion Optimization Algorithm (CALO) is proposed in [19]. The authors applied CALO for feature selection problem, where five chaotic maps are employed. These maps are the singer, piecewise, logistic, tent and sinusoidal. The experimental results reveal the capability of CALO to find better feature subset than original ALO and other meta-heuristic algorithms. Authors in [7] proposed Chaotic Crow Search Algorithm (CCSA) for solving feature selection problem. In this approach, ten chaotic maps are employed. The experimental results show that the efficiency of CCSA in finding an optimal feature subset, which maximizes the classification performance, while minimizing the number of selected features.

Feature selection is an essential task for machine learning tasks. Recently, data obviously increases in both number of features and number of instances. Thus, removing irrelevant and noisy features is considered a challenging task especially for large dataset. Feature selection algorithms have proved their efficiency in removing redundant features, reducing computational cost, improving the performance of a classifier and reducing the required storage [20]. Additionally, they have been successfully applied in many applications including image retrieval [21], text categorization [22], customer relationship management [23] and diagnostic medical support systems [24, 25]. Also, feature selection can influence the clustering performance [20]. Moreover, it can further help for better data interpretation and understanding than using all features. Thus, several studies have been presented in literature applying feature selection such as in [20, 26]. From studies, it has been proved that building cluster with relevant features is more interpretable and practical in use than building a cluster with all features, which can include noise and irrelevant features. Feature selection algorithms are divided into two families; wrapper-based and filter-based algorithms. In wrapper-based algorithms, machine learning algorithms are used in evaluation, while statistical methods are used for filter-based algorithms [27]. However, it has been proven that wrapper-based algorithms provide better results compared with filter-based algorithms. However, wrapper-based algorithms are computationally expensive. In addition, filter-based algorithms such as sequential forward selection (SFS) and sequential backward selection (SBS) suffer from entrapment at local optima [27]. On the other hand, meta-heuristic algorithms can find the optimal feature subset through using their agents. Recently, meta-heuristic algorithms are applied for feature selection problem such as Moth-Flame Optimization (MFO) [28], Gray Wolf Algorithm (GWO) [29], Particle Swarm Optimization (PSO) [30] and Artificial Bee Colony (ABC) [31].

In this work, a novel hybridization approach based on Salp Swarm Algorithm (SSA) and chaos theory is proposed. SSA is one of most recently proposed algorithm. It is proposed in 2017 by Mirjalili [32]. The main inspiration of the algorithm came from the swarming behavior of salps. Salps are kind of Salpidae with transparent barrel-shaped body. The inventor of original SSA algorithm, showed that the efficiency of this algorithm compared with other well-know meta-heuristics algorithms. Although, slow convergence speed and entrapment in local optima are two possible problems, similar to other meta-heuristic algorithms. As it is previously mentioned, chaos theory is one of most common methods used to improve the performance of evolutionary algorithms. Currently, to the best of our knowledge, there is no work proposed in literature to improve the exploitation and the exploration of SSA. The main contributions of this paper summarized as follows:

A novel hybridization approach based on SSA and chaos theory is proposed.
The proposed approach is applied for 14 global optimization problem
A binary version of CSSA is proposed and applied for feature selection problem, where 20 benchmark datasets are employed.
Ten most popular chaotic maps proposed in literature are employed and compared.
Several evaluation criteria are used in evaluation. These criteria are; mean, standard deviation, p-values of the Wilcoxon rank sum, trajectory, search history, average fitness of whole population, convergence curves and average length of the selected feature subset.
The performance of the proposed approach is compared with original SSA and five other well-known meta-heuristic algorithms; artificial bee colony (ABC), moth flame optimization (MFO), particle swarm optimization (PSO), chicken swarm optimization (CSO) and gray wolf optimizer (GWO)

The rest of this paper is organized as follow. Section 2 provides a full description of the original SSA algorithm concerning its nature inspiration and mathematical model. Also, the basics of chaotic concepts are illustrated in this section. The novel Chaotic Salp Swarm Algorithm (CSSA) is described in Section 3. Section 4 provides simulation results and discussions along with the used benchmark datasets. Finally, conclusions are presented in Section 5.

2 Basics and background

In this section, the basic knowledge about salp behavior and optimization algorithms based on salp behavior are proposed. In addition, the basic knowledge of chaotic maps is discussed as well.

2.1 Salp swarm algorithm (SSA)

Inspiration analysis

Salp Swarm Algorithm (SSA) is one of meta-heuristic algorithms recently proposed by Mirjalili [32] in 2017. The main inspiration of this algorithm came from the swarming behavior of salps. Salps are kind of Salpidae. They have a transparent barrel-shaped body and similar tissues like jellyfishes. In addition, they move like jellyfish, as the water is pumped through the body as propulsion to move forward. Sometimes, they form a swarm, namely salp chain. However, till now, the reason behind this behavior is not obvious. Some studies refer to this behavior when salps are searching for optimal locomotion based on rapid foraging and coordinated changes.

Mathematical model of SSA

The population of SSA is divided into two groups. These two groups are leader and followers. The leader is the one that taking the position at the front of the chain, while the remainder of salps is known as followers. Let, dim is the number of variables or number of dimensions for a given problem. y is defined as the position of a salp. F denotes the food source, which is the target of the swarm in the search space. The leader updates his position according to the following equation:

$$ {y}_{i}^{1} = \left\{ \begin{array}{ll} F_{i} + r_{1}((ub_{i}-lb_{i})r_{2}+lb_{i}), &\quad r_{3}\geq 0\\ F_{i}-r_{1}((ub_{i}-lb_{i})r_{2}+lb_{i}), &\quad r_{3}< 0 \end{array}\right. $$

(1)

where ${y}_{i}^{1}$ denotes as first salp’s position in i − th dimension, known as leader, lb_i,ub_i are the lower boundary and upper boundary at i − th dimension, respectively, F_i is the food’s position in i − th dimension and r₁, r₂ and r₃ are random numbers. r₁ is responsible for balancing between exploitation and exploration. It is mathematically defined as follows:

$$ r_{1} = 2e^{-\left( \frac{4t}{T}\right)^{2}} $$

(2)

where t is the current iteration number, and T is the maximum number of iterations. r₂ and r₃ are two random numbers uniformly generated in range [0,1]. r₃ is responsible for indicating whether the next position should to be toward negative infinity or positive infinity. The followers update their position according to Newton’s law of motion using the following equation:

$$ {y}_{i}^{j} = \frac{1}{2} \alpha l^{2}+\beta_{0}l $$

(3)

where ${y}_{i}^{j}$ shows the position of j − th follower in i − th dimension, i ≥ 2, β₀ is the initial speed, $\alpha =\frac {\beta _{final}}{\beta {0}}$, $\beta =\frac {y-y_{0}}{t}$ and l is time. As the time is called an iteration in the optimization process, the discrepancy within iteration equals to one. Let β₀ = 0, the updating position of followers in i − th dimension can be represented as follows:

$$ {y}_{i}^{j} = \frac{1}{2}\left( {y}_{i}^{j}+{y}_{i}^{j-1}\right) $$

(4)

2.2 Chaotic maps

Most meta-heuristic algorithms have randomness parameters. These parameters are using probability distribution, usually Gaussian or uniform distributions. Recently, chaos theory used to enhance these parameters [33]. It has the same characteristics as randomness, with better dynamical and statistical characteristics [34]. Chaos is known as a phenomenon. Any change of the initial condition of chaos may lead to non-linear change for the future behavior. Three main properties are describing the chaos; (1) quasi-stochastic, (2) ergodicity and (3) sensitivity to initial conditions. Mixing these characteristics can guarantee the diversity of the generated solutions. Thus, this diversity can be enough to reach every mode of the multi-modal problem [35]. Quasi-stochastic is defined as the ability to replace random variables with values of chaotic maps. Ergodic property refers to the ability of chaotic variables to search non-repeatedly all states within a certain range. Finally, sensitivity to initial condition property is defined as any small change, in the initial starting points, may lead to different behavior. Combining all these properties can significantly boost the performance of meta-heuristic optimization algorithms [36].

In this study, ten distinguished non-invertible maps with different characteristics are used. Table 1 shows these maps, where o_s denotes the s − th number in the chaotic sequence and s is defined as the index of the chaotic sequence o. The other parameters including c, d, and μ are the control parameters. These parameters are used to determine the chaotic behavior of the dynamic system. From this table, it can be observed that chaotic maps have a determinate form without including random factors. In this work, the initial point (o₀) for all adopted chaotic maps is initially set to 0.7. This value is same value used for the same ten chaotic maps in [37].

Table 1 Definition of the ten adapted chaotic maps

A novel chaotic salp swarm algorithm for global optimization and feature selection

Abstract

Similar content being viewed by others

Feature Selection Using Chaotic Salp Swarm Algorithm for Data Classification

A new improved salp swarm algorithm using logarithmic spiral mechanism enhanced with chaos for global optimization

Differential evolution-assisted salp swarm algorithm with chaotic structure for real-world problems

Explore related subjects

1 Introduction

2 Basics and background

2.1 Salp swarm algorithm (SSA)

Inspiration analysis

Mathematical model of SSA

2.2 Chaotic maps

3 The novel chaotic salp swarm algorithm

3.1 Parameters initialization

3.2 Fitness function

3.3 Positions updating

3.4 Termination criteria

4 Simulation results and discussions

4.1 CSSA for global optimization problem

4.2 CSSA for feature selection

5 Conclusion

References

Author information

Authors and Affiliations

Corresponding author

Appendix A: List of benchmark functions

Appendix A: List of benchmark functions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation