Chaotic vortex search algorithm: metaheuristic algorithm for feature selection

Gharehchopogh, Farhad Soleimanian; Maleki, Isa; Dizaji, Zahra Asheghi

doi:10.1007/s12065-021-00590-1

Chaotic vortex search algorithm: metaheuristic algorithm for feature selection

Research Paper
Published: 20 March 2021

Volume 15, pages 1777–1808, (2022)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Evolutionary Intelligence Aims and scope Submit manuscript

Chaotic vortex search algorithm: metaheuristic algorithm for feature selection

Download PDF

Farhad Soleimanian Gharehchopogh ORCID: orcid.org/0000-0003-1588-1659¹,
Isa Maleki¹ &
Zahra Asheghi Dizaji¹

1262 Accesses
116 Citations
Explore all metrics

Abstract

The Vortex Search Algorithm (VSA) is a meta-heuristic algorithm that has been inspired by the vortex phenomenon proposed by Dogan and Olmez in 2015. Like other meta-heuristic algorithms, the VSA has a major problem: it can easily get stuck in local optimum solutions and provide solutions with a slow convergence rate and low accuracy. Thus, chaos theory has been added to the search process of VSA in order to speed up global convergence and gain better performance. In the proposed method, various chaotic maps have been considered for improving the VSA operators and helping to control both exploitation and exploration. The performance of this method was evaluated with 24 UCI standard datasets. In addition, it was evaluated as a Feature Selection (FS) method. The results of simulation showed that chaotic maps (particularly the Tent map) are able to enhance the performance of the VSA. Furthermore, it was clearly shown the fitness of the proposed method in attaining the optimal feature subset with utmost accuracy and the least number of features. If the number of features is equal to 36, the percentage of accuracy in VSA and the proposed model is 77.49 and 92.07. If the number of features is 80, the percentage of accuracy in VSA and the proposed model is 36.37 and 71.76. If the number of features is 3343, the percentage of accuracy in VSA and the proposed model is 95.48 and 99.70. Finally, the results on Real Application showed that the proposed method has higher percentage of accuracy in comparison to other algorithms.

Chaotic multi-verse optimizer-based feature selection

Article 29 June 2017

Feature Selection Method Based on Chaotic Maps and Butterfly Optimization Algorithm

A New Chaotic Whale Optimization Algorithm for Features Selection

Article 01 July 2018

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The metaheuristic optimization algorithms were proposed over the past decades and implemented extensively to the problem of the complicated [1, 2]. The essential target in the optimization is the candidate the problem variables to minimize or maximize the objective function based on the global and local search [3, 4]. So as to triumph over the state-of-the-art goals in any problem, most of such algorithms were applied as an attempt to establish an approximate technique for attaining the optimum solution [5, 6]. A number of well-known new nature-inspired algorithms include the Invasive Weed Optimization (IWO) [7], the butterfly optimization algorithm (BOA) [8], the Artificial Bee Colony (ABC) [9], the Fruit Fly Optimization Algorithm (FOA) [10], the Firefly Algorithm (FA) [11], the Krill Herd (KH) algorithm [12], the Differential Evolution (DE) algorithm [13], the Flower Pollination algorithm (FPA) [14], etc. The distinction in nature is an essential factor why the algorithms have an alternate dimension of execution in delivering results [15, 16]. Besides, this factor might be the motivation behind why a few algorithms can best item an answer for specific issues, while others don't. Thus, it is according to this limitation that one algorithm is not good enough for solving every kind of problem.

During the past decade, an arithmetic framework and scientific branch, namely chaos, has been proposed, and is connected deeply with different scientific fields. Chaos involves three major dynamic properties: the quasi stochastic property, being sensitive against initial conditions, and ergodicity. The application of chaos theory in optimization research areas has attracted a lot of attention over the recent years. The Chaotic Optimization Algorithm (COA) [17] is among the applications of chaos, and uses the nature of chaos sequences. It has been indicated if random variables are replaced with chaotic variables, the performance of COA can be enhanced. Therefore, in the literature, there are a number of studies on the hybridization of chaos with other algorithms for the purpose of improving the performance of COA. Some instances include the chaotic ACO [18], chaotic DE algorithm [19, 20], chaotic KH algorithm [21, 22], chaotic FPA [23], chaotic genetic algorithm [24, 25], chaotic PSO [26,27,28], chaotic gravitational search [29,30,31], chaotic bat algorithm [32], and etc.

FS is the procedure of selecting a subset of features from an original feature set; it may be considered the most important pre-processing instrument to solve classification issues [33]. Figuring out a superior subset of features is a quite complicated challenge, and is decisive in the final results of the rates of classification error. The finalized feature subset will retain high rates of classification accuracy. The purpose is choosing an applicable subset including d features from a set of D features (d < D) in a given dataset [34]. D is made out of all features that are present in a given data set; it can encompass redundant, noisy, and misleading features. Consequently, an exhaustive search is performed within the whole solution environment, which usually takes a lot of time and cannot often be implemented in practice. To remedy this FS strategy, maintaining the best subset of d relevant features was taken into consideration. Inappropriate features are not only useless, but also can certainly worsen the classification performance. If irrelevant features are deleted, the computational efficiency can be advanced and classification accuracy expanded.

As indicated by search techniques of feature subsets, the current FS strategies can be classified into two classes: the filter-based approach and the wrapper-based approach. The filter method depends fundamentally on general qualities of datasets to assess and choose include subsets without taking into account an uncommon learning approach. Thus, the productivity of this methodology depends predominantly on the dataset itself instead of on the classifier [35, 36]. The wrapper method utilizes a classification calculation to assess feature subsets and embraces a search system to look for ideal subsets. It often leads to better since the wrapper approach takes into consideration a classifier with the evaluation or search process [37].

Each meta-heuristic algorithm has a unique search strategy. Meta-heuristic algorithms can find optimal solutions based on its own strategy such as balance between exploration and exploitation. Furthermore, VSA has advantages such as the smaller number of parameters and easy implementation. The VSA was embedded with chaotic maps to obtain a better compromise between exploitation and exploration. This paper uses hybrid methods based on CMs with the VSA for FS. The major contribution of the current paper is that a CMs model of VSA has been proposed to enhance the performance of VSA. In proposed methods, the chaotic seek method are followed to choose the ideal characteristic subset that maximizes the category accuracy and minimizes the feature subset duration. Ten one-dimensional CMs are adopted and changed with random movement parameters of the VSA. The performance of the proposed methods is tested on 24 benchmark datasets. Similarly, the performance of VSA is comparison with seven other metaheuristic algorithms. Based on mean criterion, the proposed method can obtain better solutions using the Tent Map in comparison with other metaheuristic algorithms.

The main contributions of this paper are as follows:

VSA and Chaotic Maps are defined to FS.
The proposed method has a faster convergence performance than the other algorithms. The proposed method has better convergence results on different datasets.
The proposed method has been evaluated with 24 UCI standard datasets.
The best VSA is State2 with VSAC101 that obtained by using the Tent map.
The proposed method has been tested on author identification datasets
The obtained results confirmed the validity and superiority of the proposed method in comparison to other algorithms.

The organization of this paper is as follows: Sect. 2 gives related works about chaotic and FS. Section 3 provides an introduction to VSA. The detailed description of the proposed method has been provided in Sect. 4, while the experimental results and discussion of the proposed VSA have been provided in Sect. 5. In Sect. 6, the proposed method has been applied on a real application (i.e., author identification). Finally, the conclusion and future work have been discussed in Sect. 7.

2 Related works

The Moth Swarm Algorithm (MSA) is among the most recently-developed nature-inspired heuristics for the purpose of the optimization problem. However, its shortcoming is that it has slow convergence rate, and the Chaos theory has been incorporated into it to eliminate this drawback. In [38], ten CMs have been embedded within the MSA for the purpose of finding the ideal number of prospectors to increase exploiting the most promising solutions. The proposed method was applied in solving the famous seven benchmark test functions. The results of simulation showed that CMs can enhance the performance of the original MSA with regard to the convergence speed. In addition, the sinusoidal map was found to be the best map for enhancing the performance of MSA.

The Cuckoo search algorithm (CSA) is a metaheuristic algorithm that has been inspired by nature and imitates the obligate brood parasitic behavior of the cuckoo species. The method has been proven to have promising overall performance in solving optimization problems. Chaotic mechanisms were incorporated into CSA to make use of the dynamic features of the chaos theory, to further improve its search overall performance. However, in chaotic CSA (CCSA) [39], the best CM was applied in a single search of the new release, which restrained the exploitation capability of the search. The researchers considered utilizing multiple CMs at the same time to perform the nearby search inside the community of the global best solution that is found by CSA. To attain this goal, three kinds of multiple chaotic CSAs (MCCSA) were proposed via incorporating several CMs into the chaotic local search (CLS) parallel in a random or selective manner. The overall performance of MCCSA was validated using 48 broadly-used benchmark optimization features. The experimental results indicated that MCCSAs are generally better than CCSAs, and the MCCSA-P that makes use of the CMs has the best quality among all sixteen editions of the CSAs.

In [40], a chaos-based Crow Search Algorithm (CCSA) has been proposed to solve the fractional optimization problems (FOPs). The proposed CCSA integrated the chaos theory (CT) into the CSA for the purpose of refining the global convergence velocity and enhance the exploration/exploitation inclinations. CT was utilized to track the standard CSA parameters, which yielded four versions and the high-quality chaotic variant was investigated. The incorporation of CT was able to improve the overall performance of the proposed CCSA and allow the search process to perform better speeds. The overall performance of the CCSA method was proven on twenty fractional benchmark problems. Furthermore, it was further tested on a fractional monetary environmental power dispatch problem via attempting to limit the ratio of the overall emissions to general gasoline cost. Ultimately, the proposed CCSA was compared with the PSO, standard CSA, FA, Dragonfly Algorithm (DA), and GWO. In addition, the efficiency of the proposed CCSA was justified by the non-parametric Wilcoxon signed-rank test. The experimental results proved that the proposed CCSA performs better than similar algorithms with regard to efficiency and reliability.

In [41], a new hybrid algorithm for solving optimization problems based on chaotic ABC and chaotic simulated annealing has been proposed. The chaotic ABC reveals new locations chaotically. Chaos may additionally improve the exploration of the search space. Really, the proposed hybrid method affords a hybrid of nearby search accuracy of simulated annealing and the capacities of global seek of ABC. Moreover, they used an exclusive method for producing the initial population. Sincerely preliminary populace is of brilliant significance for populace-based techniques, because it immediately influences the rate of convergence and nice of the outcomes. It is established the usage of 12 benchmark functions. The effects are as compared with those of the artificial bees’ algorithm, the hybrid algorithm of ABC and simulated annealing and PSO. Simulation effects display the performance of the proposed method.

In [42], an adaptive chaotic Bacterial Foraging Optimization (BFO) is presented. The improved BFO consisted of two new features, the adaptive chemotaxis step setting, and the chaotic perturbation operation in all chemotactic events. The former feature results in fast convergence rate and the acceptable convergence accuracy in the algorithm, while the latter further allows the search to avoid the local optima and attain better convergence accuracy. Firstly, an idea of adaptive exponential decease chemo taxis step is presented, in which the natural exponential function variable is a function about the iterations and nutritive ratio between the current bacterium position and the best bacterium position in each iteration. Secondly, when each bacterium reaches a new position through swim behavior, chaotic perturbation is applied to avoid entrapping into local optima. With five benchmark functions, Chaotic BFO is proved to have a better performance than the original BFO and BFO with linear deceasing chemo taxis step (BFO-LDC).

Jia et al. [43] proposed an effective memetic DE algorithm (DECLS), which makes use of a CLS with a ‘shrinking’ strategy. The shrinking strategy for the CLS search space was introduced in that paper. In addition, the local search length was determined according to the feedback of the fitness of the objective functions in a dynamic manner in order to save the function evaluations. Furthermore, the parameter settings of the DECLS were adapted in the process of evolution so as to further enhance the optimization efficiency. The hybrid form of the DE and a CLS as well as a parameter adaptation mechanism seemed very reasonable. The CLS is helpful in enhancing the local search capability of DE, whereas the parameter adaptation can improve the global optimization quality. The CLS is helpful in improving the optimization performance of the canonical DE through exploring a very large search space in the early phases so as to avoid the occurrence of premature convergence, and exploiting a tiny region in later phases to refine the finalized solutions. In addition, the settings of parameters in the DECLS were controlled adaptively to further improve the search capability. To assess the efficiency and effectiveness of the proposed DECLS algorithm, it was compared with four state-of-the-art DE variants and the IPOP-CMA-ES algorithm on a set of 20 selected benchmark functions. The findings showed that the DECLS is significantly superior, or at least comparable, to other optimizers with regard to the convergence performance and solution accuracy. Furthermore, the DECLS was shown to have certain advantages in terms of solving problems with high dimensions.

In [44], a modified DE algorithm based on the Opposition-based Learning (OBL) and a chaotic sequence named the OBL Chaotic DE (OBL-CDE) was proposed. The proposed OBL-CDE algorithm is different from the basic DE in two ways. The first one is related to the generation of the initial population that follows the OBL rules, while the second one is the dynamic adaption of the scaling factor F through using the chaotic sequence. The numerical results obtained by the OBL-CDE compared to the results of DE and the opposition-based DE algorithms on 18 benchmark functions indicated that the OBL-CDE is capable of finding more superior solutions and maintaining reasonable convergence rates at the same time.

The standard Glowworm Swarm Optimization (GSO) shows poor ability in global search and easily gets trapped into local optima. A Quantum GSO algorithm based on CMs was proposed [45] in order to solve such problems. First of all, a chaotic sequence was generated to initialize the population. This process results in higher probability to cover more local optimal areas, and provides the ground for further optimization and tuning. Next, the quantum behavior was applied to the elite population, which made it possible for individuals to locate any position of the solution space randomly with a certain probability. This greatly enhanced the capability of the algorithm in global search and avoiding local optima. Finally, it adopted the single dimension loop swimming instead of the original fixed-step movement mode. This not only improved the solution precision and convergence speed, but also solved GSO problems that were too sensitive to the step-size, and indirectly enhanced the robustness of the algorithm. The simulation results indicated that the proposed method was feasible and effective.

The Fruit Fly Algorithm (FOA) has recently been proposed as a metaheuristic technique, and is inspired by the behavior of fruit flies. Mitic et al. [46] improved the standard FOA through introducing the novel parameter in combination with chaos. The performance of this chaotic FOA (CFOA) was studied on ten famous benchmark problems using 10 different CMs. In addition, comparison studies with the basic FOA, FOA with Levy flight distribution, and other recently-published chaotic algorithms were made. Statistical findings on each optimization task showed that the CFOA results in a very high convergence rate. In addition, CFOA is compared with recently developed chaos enhanced algorithms such as chaotic bat algorithm, chaotic-accelerated PSO, chaotic FA, chaotic ABC, and chaotic CSA. Research findings generally indicate that FOA with Chebyshev map show superiority to the similar methods in terms of the reliability of global optimality and the algorithm success rate.

In addition, Gandomi et al.[47] proposed a chaos-enhanced version of the accelerated PSO. Some other instances of chaos-enhanced metaheuristic algorithms include the chaotic Genetic Algorithm [48], Chaotic PSO [49, 50], Chaotic Salp Swarm Algorithm [51], Chaotic Elephant Herding Optimization (EHO) algorithm [52], Chaotic Bat Algorithm[53], Chaotic FOA[46], Chaotic GSO Algorithm [45, 54], Chaotic Black Hole algorithm [55], Chaotic Simulated Annealing PSO Algorithm (CSAPSO) [56], Chaotic Social Spider Optimization Algorithm[57], Chaotic Bean Optimization Algorithm[58], Chaotic Quantum CSA [59], Chaotic Antlion Algorithm[60], Chaotic Hybrid Cognitive Optimization Algorithm[61], Chaotic Simulated Annealing [62], Chaotic Based Quantum Genetic Algorithm [63], Chaotic Teaching Learning Algorithm[64], Chaotic DE algorithm [65], Chaotic Grey Wolf Optimization Algorithm[66], Chaotic Fractal Search[67], Chaotic Brain Storm Optimization Algorithm [68], Multi-Objective CCSA [69], Chaotic Grasshopper Optimization Algorithm [70], Chaotic Krill Herd [21, 71, 72], Chaotic DE[73], Chaotic Firefly Algorithm [74, 75], Chaotic Starling PSO Algorithm[76], Chaotic CCSA [77], Chaotic Grey Wolf Optimization Algorithm [78] and etc. Table 1 shows a comparison of different models of meta-heuristic algorithms based on chaotic map.

Table 1 A Comparison of Different Models of Meta-heuristic Algorithms based on Chaotic Maps

Chaotic vortex search algorithm: metaheuristic algorithm for feature selection

Abstract

Similar content being viewed by others

Chaotic multi-verse optimizer-based feature selection

Feature Selection Method Based on Chaotic Maps and Butterfly Optimization Algorithm

A New Chaotic Whale Optimization Algorithm for Features Selection

Explore related subjects

1 Introduction

2 Related works

3 Vortex search algorithm

3.1 Generating the initial solution

3.2 Generating the candidate solutions

3.3 Replacement of the current solution

3.4 The radius decrement process

4 Proposed methods

4.1 Fitness function

5 Result and discussion

5.1 Datasets description

5.2 Analysis and discussion

5.3 Comparisons between VSA and proposed method based on FS

5.4 Comparison and evaluation

6 Real application: author identification

6.1 Reuter_50_50 dataset

6.2 PAN dataset

6.3 Enron email dataset

6.4 Arabic scripts

7 Conclusion and feature works

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation