Boosting whale optimization with evolution strategy and Gaussian random walks: an image segmentation method

Hussien, Abdelazim G.; Heidari, Ali Asghar; Ye, Xiaojia; Liang, Guoxi; Chen, Huiling; Pan, Zhifang

doi:10.1007/s00366-021-01542-0

Boosting whale optimization with evolution strategy and Gaussian random walks: an image segmentation method

Original Article
Published: 27 January 2022

Volume 39, pages 1935–1979, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Engineering with Computers Aims and scope Submit manuscript

Boosting whale optimization with evolution strategy and Gaussian random walks: an image segmentation method

Download PDF

Abdelazim G. Hussien^1,2,
Ali Asghar Heidari³,
Xiaojia Ye⁴,
Guoxi Liang⁵,
Huiling Chen ORCID: orcid.org/0000-0002-7714-9693³ &
…
Zhifang Pan⁶

1876 Accesses
90 Citations
Explore all metrics

Abstract

Stochastic optimization has been found in many applications, especially for several local optima problems, because of their ability to explore and exploit various zones of the feature space regardless of their disadvantage of immature convergence and stagnation. Whale optimization algorithm (WOA) is a recent algorithm from the swarm-intelligence family developed in 2016 that attempts to inspire the humpback whale foraging activities. However, the original WOA suffers from getting trapped in the suboptimal regions and slow convergence rate. In this study, we try to overcome these limitations by revisiting the components of the WOA with the evolutionary cores of Gaussian walk, CMA-ES, and evolution strategy that appeared in Virus colony search (VCS). In the proposed algorithm VCSWOA, cores of the VCS are utilized as an exploitation engine, whereas the cores of WOA are devoted to the exploratory phases. To evaluate the resulted framework, 30 benchmark functions from IEEE CEC2017 are used in addition to four different constrained engineering problems. Furthermore, the enhanced variant has been applied in image segmentation, where eight images are utilized, and they are compared with various WOA variants. The comprehensive test and the detailed results show that the new structure has alleviated the central shortcomings of WOA, and we witnessed a significant performance for the proposed VCSWOA compared to other peers.

A Novel Biologically Inspired Approach for Clustering and Multi-Level Image Thresholding: Modified Harris Hawks Optimizer

Article 31 January 2022

Boosting Whale Optimizer with Quasi-Oppositional Learning and Gaussian Barebone for Feature Selection and COVID-19 Image Segmentation

Article 28 November 2022

A novel improved whale optimization algorithm to solve numerical optimization and real-world applications

Article 16 January 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In the last 2 decades, many metaheuristic algorithms have been proposed by researchers due to their advantages like flexibility, bypassing local optima, and they did not need gradient information [1]. Metaheuristics algorithms have gained huge attention and a significant interest as they can solve real-world optimization problems by mathematically simulating physical/biological phenomena. They have been applied to many problems [2, 3]. These algorithms can be divided into four major categories: l(EAs), Swarm Intelligence-based algorithm (SI-based), physics & chemistry algorithms, and Human-based Algorithm. The first category (evolutionary algorithms) contains algorithms inspired by natural evolution. In EAs, there is a randomly generated population at first to start. Then, all individuals are evaluated over a generation to produce new individuals using crossover and mutation processes. This category includes genetic algorithms (GA) [4], evolution simulation strategy (ES) [5], genetic programming (GP) [6], and biogeography-based optimizer (BBO) [1].

The second category is SI-based algorithms inspired by swarms’ social behavior [7], a collection of living beings in nature. Examples of SI algorithms are particle swarm algorithm (PSO) [8], ant colony optimization [9], Harris hawks optimizer (HHO) [10], virus colony search [11], slime mould algorithm (SMA) [12], Hunger games search (HGS) [13], and Runge–Kutta (RUN) optimizer [14]. The third class includes algorithms that simulate physical or chemical phenomena. Examples of this class are simulated annealing (SA) [15], and gravitational search algorithm (GSA) [16]. The last class includes algorithms, which are inspired by human behavior like teaching learning-based optimization (TLBO) [17] and tabu (Taboo) search (TS) [18]. In addition to engineering optimization problems [19, 20], these stochastic methods have found their applications and contributions in more complex problems and attracted many works in science and engineering fields such as medical data classification [21,22,23,24], scheduling problems [25, 26], feature selection [27,28,29], wind speed forecast [30], engineering design problems [31,32,33]. Furthermore, potential of metaheuristics is not limited to such problems and still much room to discover in the fields of hard maximum satisfiability problem [34, 35], bankruptcy prediction [36, 37], parameter optimization [38,39,40], PID control [41,42,43], detection of foreign fiber in cotton [44, 45], surveillance [46], service ecosystem [47, 48], micro-expression spotting [49, 50], and prediction problems in educational ground [51, 52].

WOA is a recent algorithm developed by the author in [53] that has gained huge attention due to its simple code and high similarity with grey wolf optimizer (GWO). This algorithm can outperform many state-of-the-art algorithms such as GSA, PSO, and GA. This algorithm is based on the humpback special hunting behavior, which is called the bubble net method. WOA has received huge interest and global attention since its inception, as it shows a good performance in handling many optimization tasks. Consequently, some modifications have been made by many researchers. In [54], they proposed a binary version of WOA using two transfer functions. The new versions applied to solve travel salesman problem (TSP). In [55], Aljarah et al. used WOA to find the optimal connecting weights in a neural network. Also, Elaziz et al. [56] developed a hyper-heuristic algorithm by using DE to improve the initial WOA population. In [57], Emary et al. tried to study the impact of levy flight in WOA and SCA. Likewise, Oliva et al. [58] proposed a new version of WOA using chaotic maps and applied it to estimate photovoltaic cell parameters. In [59], Xiong et al. proposed an improved version of WOA by developing two prey search strategies. In [60], Chen et al. proposed two strategies based on Lévy flight and chaotic local search to have a good balance between the core capacities. Also, authors in [61] introduced a hybrid version of WOA and SA by embedding SA in WOA as a local search strategy. In [62], Abdel-Basset et al. designed a new version of WOA and used it in Cryptanalysis in Merkle–Hellman Cryptosystem. Another hybrid version between WOA and GWO called WGC is introduced to cluster data [63]. Agrawal et al. [64] applied embedded quantum operators in WOA and used it in the feature selection problem. Authors in [65] proposed a new version using an opposition-based technique to prevent the basic whale method on from getting trapped in local optima. Also, in works of [66, 67], the proposed approach was applied to the feature selection problem. In [68], Hemasian-Etefagh et al. tried to prevent the classical WOA from trapping into local optima by introducing a new version called group WOA (GWOA), in which the population was divided into many groups based on their fitness value. In [69], Hassib et al. proposed a novel classification framework for big data using the WOA. To solve job shop scheduling problems (JSSP), [70] tried to solve it by introducing a hybrid algorithm called (WOA-LFDE) in which differential evolution (DE), and Lévy flight are hybridized with WOA. Also, in [71], Jiang et al. introduced an enhanced WOA by embodying two approaches: introducing an armed force program and adjusting beneficial strategy. In [72], Guo et al. used a modified version of WOA by using the adaptive strategy of the neighborhood to forecast the demand for water resources. Also, in [73], Got et al. introduced an enhanced multi-objective version of WOA called guided population archive WOA (GPAWOA) to solve multi-objective problems.

WOA has been applied to many medical applications. Authors in [39] developed a chaotic multi-swarm WOA version called CMWOA using a support vector machine (SVM) and applied it to perform feature selection to many well-known and common medical diseases problems such as breast cancer, erythemato-squamous, and diabetes. Also, in [74], Abdel-Basset et al. integrated the basic WOA with TS and employed it to solve the quadratic assignment problem (Locating departments of the hospital). In [75], Tharwat et al. used WOA with SVM to be able to classify the biotransformed toxicity effects of hepatic drugs. Also, in [76], Zhao et al. mixed SVM kernel function with WOA to classify colorectal cancer diagnosis.

Gharehchopogh and Gholizadeh listed all WOA variants and applications with details in a comprehensive survey [77]. Despite the original WOA success, many works showed that its performance might degrade when solving some optimization tasks.

On the other hand, another recent metaheuristic called virus colony search (VCS) was developed [11]. VCS simulates viruses diffusion and infection behavior in attacking cells. VCS has been applied to many power optimization problems such as unit commitment [78], resource allocation [79], and distributed generators placement [80].

In this study, a new enhanced WOA-based algorithm is designed that embedded the core mechanisms of VCS into the main method. It aims to overcome these limitations by revisiting the WOA based on the core components of the Gaussian walk, CMA-ES, and evolution strategy that appeared in the VCS. This could prevent WOA from getting trapped into local optima by maintaining a better balance among the exploration and exploitation capabilities. To evaluate the resulted framework, 30 benchmark cases from IEEE CEC2017 were employed in addition to four different constrained engineering problems. Besides, the enhanced WOA-based variant has been applied to image segmentation, where eight images are utilized, and they are compared with various WOA variants. The attained results show that the new structure has alleviated the central shortcomings of WOA, and we saw a significant performance for the proposed VCSWOA compared to other peers.

This paper is organized as follows. Sections 2 and 3 give a detailed description and mathematical equations to the WOA and VCS, respectively. Sections 4 and 5 show the proposed method and results discussions. Section 6 concludes the paper.

2 Whale optimization algorithm

In this section, we present the basics of the WOA by describing its main components, such as inspiration, its mathematical model, and how it deals with exploration and exploitation. The WOA [53] introduced by Mirjalili et al. in 2016, which mimics the foraging of humpback whales. Whales are beautiful creatures that have a special hunting technique called bubble-net feeding or 9-shape. Then, other agents attempt to change their location vector to attain the best position according to Eq. (1).

$$\begin{aligned} \mathbf {D}= & {} |\mathbf {C}.\mathbf {X}^{*}(t)-\mathbf {X}(t)| \end{aligned}$$

(1)

$$\begin{aligned} \mathbf {X(t+1)}= & {} \mathbf {X}^{*} (t+1)-\mathbf {A}.\mathbf {D} \end{aligned}$$

(2)

where t denotes the counter of iteration, $\mathbf {C}$ and $\mathbf {A}$ are coefficient vectors, $\mathbf {X}^{*}$ means the position vector of the best agent, and $\mathbf {X}$ is the location vector. $\mathbf {A}$ and $\mathbf {C}$ values are obtained from the following rules:

$$\begin{aligned} \mathbf {A}= & {} 2.\mathbf {a}.\mathbf {r}-\mathbf {a} \end{aligned}$$

(3)

$$\begin{aligned} \mathbf {C}= & {} 2.\mathbf {r}, \end{aligned}$$

(4)

where a is linearly decreased from 2 to 0 over iterations and r randomly bounded in [0,1]. To mathematically simulate the exploitation phase, we have two approaches (1) Shrinking encircling: attained by decreasing a value’s with regard to Eq. (4). Note that $\mathbf {A}$ is a random value between $[-a,a]$. (2) Spiral updating: this phase realizes the distance between the whale and the prey. Equation (5), calculates the spiral that mimics the helix-shaped movement as follow:

$$\begin{aligned} \mathbf {X}(t+1)=\mathbf {D}^{l}e^{bl}.\cos (2\pi l))+\mathbf {X}^{*}(t), \end{aligned}$$

(5)

where b is constant, l is a random number in $[-1,1]$. To select either spiral moves or shrinking encircling phase, a chance of 50% is assumed as follow:

$$\begin{aligned} \mathbf {X}(t+1)=\left\{ \begin{array}{ll} \mathbf {X^{*}}(t)-\mathbf {A}.\mathbf {D}& if\; p<0.5\\ \mathbf {D^{l}}.e^{bl}.\cos (2\pi l)+\mathbf {X^{*}}(t)& if\; p\ge 0.5, \end{array}\right. \end{aligned}$$

(6)

where p is a random number in a uniform distribution. In other hand side, in exploration (diversification) stage, $1\prec A\prec -1$ is used to force the solution to move away from this location. Equations (7) and (8), represent the mathematical for exploration phase as follow:

$$\begin{aligned} \mathbf {D}= & {} |\mathbf {C}.\mathbf {X}_{rand}-\mathbf {X}| \end{aligned}$$

(7)

$$\begin{aligned} \mathbf {X}(t+1)= & {} X_{rand}-\mathbf {A}.\mathbf {D} \end{aligned}$$

(8)

The general pseudo-code steps of WOA are presented in Algorithm 1.

3 Virus colony optimization algorithm

Virus colony search (VCS) is a novel population algorithm inspired by nature, which simulates infection and diffusion techniques. VCS mainly depends on three strategies: (1) Gaussian walk, (2) CMA-ES, and (3) evolution strategy.

The population is divided into two groups: $V_{pop}$ which refers to virus colony, and $H_{pop}$, which refers to host cell colony. A host cell is infected by one virus. Then, the virus must obtain nutrients by destroying the host cell to be able to reproduce. Finally, the few best viruses remain in the next generation, and the other viruses are evolved. The following subsections simulate these steps mathematically.

3.1 Viruses diffusion

A random walk is needed in this phase to simulate virus moving. Gaussian random walk (GRW) is used since it has a good performance as given in Eq. (9).

$$\begin{aligned} V pop_{i}=\text {Gaussian}(G^{g}_{best},\tau )+(r_{1}. G^{g}_{best}-r_{2}.V_{pop_{i}}), \end{aligned}$$

(9)

where i refers to a random value and equals ${1,2,3, \ldots ,N}$ where N is the size of the population, $r_{1} \& r_{2}$ are random variables and falls in the interval [0, 1], and $\tau$ refers to the standard deviation and can be calculated as follows:

$$\begin{aligned} \tau = \log (g)/g . (V_{pop_{i}} - G^{g}_{best}). \end{aligned}$$

(10)

In Eq. (9), the term $(r_{1}. G^{g}_{best}-r_{2}.V_{pop_{i}})$ is used as a search direction in order to prevent direction from getting trapped in a local optimum. Also, the term log(g)/g is used to decrease Gaussian jump size over generations to improve the local search performance.

3.2 Host cells infection

In this stage, the virus invades the host cell and tries to destroy it until its death. Then, the virus interacts with the host cell by absorbing essential nutrition and metabolizing harmful substances. Then, the host cell will be converted into a new virus. This process is used to improve the capabilities of the exploration process and observe the exchange of information. Hence, covariance matrix adaptation evolution strategy (CMA-ES) can be used to model derivative-free and stochastic optimization. The main steps to mathematically simulate this stage is as follows:

Step 1: $H_{pop}$ updating process using Eq. (11).

$$\begin{aligned} H pop_{i}^{g}= \left( \frac{\sum _{i}^{N}V_{pop_{i}}}{N}\right) +\sigma _{i}^{g} \times N_{i}(0, c_{g}) \end{aligned}$$

(11)

where $N_{i}(0, c_{g})$ refers to the normal distribution with mean $0 \text { and }$D$\times D$ covariance matrix $c_{g}$. D refers to problem dimension, g refers to the current iteration.

Step 2: Selection of the best $\lambda$ from the previous stage as a parental vector. The selected vector center can be calculated as follow:

$$\begin{aligned} \begin{aligned} X^{g+1}_\mathrm{{mean}}&=\frac{1}{\lambda }\sum _{i=1}^{\lambda }w_{i} Vpop^{\lambda best}_{i}|wi=\ln (\lambda +1) \\&\quad \times \left( \sum _{j=1}^{\lambda }(\ln (\lambda + 1)- \ln (j))\right) , \end{aligned} \end{aligned}$$

(12)

where $\lambda$ can be calculated as ${\lfloor }{\frac{N}{2}}{\rfloor }$, i refers to the individual index, $w_{i}$ refers to recombination weight. Here, two evolution paths can be computed to track the population mean changes with an exponential decay of the past.

$$\begin{aligned}&\begin{aligned} p^{g+1}_{\sigma } =(1-c_{\sigma })p^{g}_{\sigma }+\sqrt{c_{\sigma }(2-c_{\sigma })\lambda _{w}} \\ \frac{1}{\sigma ^{g}}(c^{g})^{-1/2}(X^{g+1}_\mathrm{{mean}}-X^{g}_\mathrm{{mean}}) \end{aligned} \end{aligned}$$

(13)

$$\begin{aligned}&\begin{aligned} p^{g+1}_{c} =(1-c_{c})p^{g}_{c}+h_{\sigma }\sqrt{c_{c}(2-c_{c})\lambda _{w}} \\ \frac{1}{\sigma ^{g}}(X^{g+1}_\mathrm{{mean}}-X^{g}_\mathrm{{mean}}) \end{aligned} \end{aligned}$$

(14)

where $\lambda _{w}^{-1}=\sum _{i=1}^{\lambda } w_{i}^{2}$, $c_{\sigma }=(\lambda _{w}+2)/ (N+\lambda _{w} + 3)$, $c_{c}=4/(N + 4)$, $h_{\alpha }=1$ if $||p_{\sigma }^{g+1}||$ is large.

Step 3: Updating the step size

$\sigma ^{g+1}$ can be updated using Eq. (15).

$$\begin{aligned} \sigma ^{g+1}=\sigma ^{g} \times \exp \left( \frac{c_{\sigma }}{d_{\sigma }}\left( \frac{||p_{\sigma }^{g+1}||}{E||N(0,I)||}-1\right) \right) \end{aligned}$$

(15)

Also, covariance matrix $c^{g+1}$ is constructed using Eq. (16).

$$\begin{aligned} \begin{aligned} c^{g+1}&=(1-c_{1}-c_{\lambda })c^{g}+c_{1}p_{c}^{g+1}(p_{c}^{g+1})^\mathrm{{T}}\\&\quad +c_{\lambda }\sum _{i=1}^{\lambda }w_{i}\frac{vpop_{i}^{\lambda best}-X_\mathrm{{mean}}^{g}}{\sigma ^{g}}.\frac{(Vpop_{i}^{\lambda best} - X_\mathrm{{mean}}^{g})^\mathrm{{T}}}{\sigma ^{g}} \end{aligned} \end{aligned}$$

(16)

where $c_{1}, c_{2}$ have

$$\begin{aligned} d_{\sigma }= & {} 1+ c_{\sigma }+2\max {\{0,(\sqrt{\lambda _{w}-1}/\sqrt{N+1})-1}\} \end{aligned}$$

(17)

$$\begin{aligned} c_{1}= & {} \frac{1}{\lambda _{w}}\left( \left( 1-\frac{1}{\lambda _{w}}\right) \min \left\{ 1,\frac{2\lambda _{w}-1}{(N+2)^{2}+\lambda _{w}}+\lambda _{w}\right\} \right. \nonumber \\&\left. +\frac{1}{\lambda _{w}}.\frac{2}{\lambda _{w}} (N+\sqrt{2})^{2} \right) \end{aligned}$$

(18)

$$\begin{aligned} C_{\lambda } & = (\lambda _{w} - 1) C_{1} \end{aligned}$$

(19)

3.3 Immune response

According to the host cell immune influence system, only the highest performance virus will retain its properties to the next generation and the others are killed by the immune system. Hence, following steps are used to model the virus evolution.

Step 1: Performance rank evaluation

$Pr_{rank(1)}$ can be calculated as follow.

$$\begin{aligned} Pr_{rank(1)}=\frac{(N-i+1)}{N} \end{aligned}$$

(20)

Step 2: Evolution of individuals

$$\begin{aligned} \begin{aligned} \left\{ \begin{array}{ll} Vpop_{i,j}=Vpop_{k,j}-rand \times \\ (Vpop_{h,j} - Vpop_{i,j})&{} r > Pr_{rank(i)}\\ \\ Vpop_{i,j}=Vpop_{i,j}&{} {\text {otherwise}} \end{array} \right. \end{aligned} \end{aligned}$$

(21)

where the 3 variables k, i, h are chosen randomly from $[1, 2, 3, \ldots , N]$ such that $i \ne k \ne h$, and $j \in 1, 2, 3, \ldots , d]$ and rand and r are the random values $\in$ [0, 1].

4 Proposed algorithm

In this section, the structure of the proposed WOA-based method is explained in detail, as given in Fig. 1. The basic WOA has some core limitations, especially in solving complex problems, mainly the multimodal functions and high dimensional ones. The main WOA’s limitations are dropping into local optima and the problem of the slow convergence.

VCSWOA aims to overcome these limitations by revisiting the WOA based on the core components of Gaussian walk, CMA-ES, and evolution strategy that appeared in the VCS. These ideas are to enhance the convergence speed and local optima avoidance of the WOA method. Here, the components of the VCS algorithm are devoted to performing intensification drifts to make the WOA algorithm more capable of avoiding local optima, which will reflect an improvement in exploitation abilities. On the other hand, the conventional cores of the WOA are utilized to handle exploratory patterns we need during a well-organized searching around the regions of the feature domain. In this way, we can reach a well-harmonized balance between exploitation and exploration procedures.

The pseudo-code of VCSWOA is shown in Algorithm , and it works as follows: An initial whale population is generated randomly at the initial state. Then, the three phases of Gaussian walk, CMA-ES, and evolution strategy are performed to further evolve the immature population: i.e., viruses diffusion, host cell infection, and immune response in VCS. After that, the updating phase of each search agent’s position is done based on p and |A| values.

5 Experiment

In this section, many experiments have been performed to prove the efficiency of the proposed algorithm: benchmark functions, Engineering problems, and image segmentation problems.

5.1 Benchmark functions

Thirty functions from the IEEE CEC2017 benchmark have been used. Table 1 defines these functions and their type including unimodal, multimodal, hybrid, and composite. VCSWOA has been compared with other eight WOA variants namely: chaotic WOA (CWOA) [81], Opposition learning-based WOA (OBWOA) [82], A-C parametric WOA (ACWOA) [83], Enhanced associative learning-based exploratory WOA (BMWOA), improved WOA (IWOA) [84], Balanced WOA with levy flight and chaotic local search (BWOA) [60], Multi-strategy boosted mutative WOA (CCMWOA) [], and Levy flight-based WOA (LWOA) [57]. The parameter settings for each algorithm are given in Table 2. The number of individuals, number of dimensions, and the maximum number of function evaluations (MaxFEs) are given in Table 3. We used same conditions as per fair comparisons settings in artificial intelligence community [85, 86].

Table 4 shows the experiment results in terms of average (mean), standard deviation (std), best (min), and worst (max). From this table, it can be noticed that VCSWOA ranked first in all unimodal functions (F1–F3) in Avg and Std values. However, in multimodal ones, VCSWOA has ranked first in avg at five functions (F4, F6, F8, F9, and F10), and ranked second in the other 3 functions. For both composite and hybrid functions, VCSWOA achieved the highest value in 7 functions and the second highest in other 8 functions. Also, Table 5 shows the Wilcoxon signed-rank [87] results in which VCSWOA has considered superior compared with other algorithms with a p value smaller than 5%. Figure 2 shows the convergence curve for 10 selected functions.

Table 1 CEC2017 benchmark functions

Boosting whale optimization with evolution strategy and Gaussian random walks: an image segmentation method

Abstract

Similar content being viewed by others

A Novel Biologically Inspired Approach for Clustering and Multi-Level Image Thresholding: Modified Harris Hawks Optimizer

Boosting Whale Optimizer with Quasi-Oppositional Learning and Gaussian Barebone for Feature Selection and COVID-19 Image Segmentation

A novel improved whale optimization algorithm to solve numerical optimization and real-world applications

Explore related subjects

1 Introduction

2 Whale optimization algorithm

3 Virus colony optimization algorithm

3.1 Viruses diffusion

3.2 Host cells infection

3.3 Immune response

4 Proposed algorithm

5 Experiment

5.1 Benchmark functions

5.2 Engineering problems

5.2.1 Pressure vessel design problem

5.2.2 Welded beam design problem

5.2.3 Tension/compression spring design problem

5.2.4 Cantilever beam design problem

5.3 Image segmentation

6 Conclusion and future works

References

Author information

Authors and Affiliations

Corresponding authors

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation