New mutation strategies of differential evolution based on clearing niche mechanism

Li, Yanan; Guo, Haixiang; Liu, Xiao; Li, Yijing; Pan, Wenwen; Gong, Bing; Pang, Shaoning

doi:10.1007/s00500-016-2318-4

New mutation strategies of differential evolution based on clearing niche mechanism

Focus
Published: 29 August 2016

Volume 21, pages 5939–5974, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

New mutation strategies of differential evolution based on clearing niche mechanism

Download PDF

Yanan Li ORCID: orcid.org/0000-0002-1915-9690^1,2,
Haixiang Guo^1,2,3,4,
Xiao Liu^1,2,
Yijing Li^1,2,
Wenwen Pan^1,2,
Bing Gong⁵ &
…
Shaoning Pang⁶

633 Accesses
10 Citations
Explore all metrics

Abstract

Although differential evolution (DE) algorithms have been widely proposed for tackling various of problems, the trade-off among population diversity, global and local exploration ability, and convergence rate is hard to maintain with the existing strategies. From this respective, this paper presents some new mutation strategies of DE by applying the clearing niche mechanism to the existing mutation strategies. Insteading of using random, best or target individuals as base vector, the niche individuals are utilized in these strategies. As the base vector is from a subpopulation, which is made up of the best individuals in each niche, the base vector can be guided by the global or local best ones. This mechanism is beneficial to the balance among population diversity, search capability, and convergence rate of DE, since it can both enhance the population diversity and search capability. Extensive experimental results indicate that the proposed strategies based on clearing niche mechanism can effectively enhance DE’s performance.

Graphical Abstract

An adaptive mutation strategy correction framework for differential evolution

Article 08 February 2023

Dual mutations collaboration mechanism with elites guiding and inferiors eliminating techniques for differential evolution

Article 09 November 2021

Refining differential evolution with mutation rate and neighborhood weight local search

Article 23 November 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

The differential evolution (DE) (Storn and Price 1995, 1996) algorithm was proposed by Price and Storn in 1995 and has been a very competitive form of evolutionary computing afterward. As a simple powerful search technique, DE is always employed for solving complex continuous nonlinear functions. With a random population by initializing solutions, the DE algorithm employs mutation and crossover operators to generate new candidate solutions, and utilizes a simple selection operator to determine whether the offspring should replace their parents in the next generation. Compared with most other evolutionary algorithms (EAs), DE is simpler and much easier to be implemented. Moreover, the gross performance of DE in terms of accuracy, fast convergence speed and robustness makes it as an attractive algorithm to be applied on various real-world optimization problems. In addition, the number of control parameters in DE is very few (mutation control parameter, also called scaling factor , crossover control parameter, also called crossover rate and population size in classical DE). Therefore, the DE algorithm has gained much attention with successful applications in data mining (Zhu et al. 2012; Tvrdík and Křivý 2015), scheduling (Mokhtari and Salmasnia 2015), construction engineering (Ho-Huu et al. 2015), pattern recognition (Secmen and Tasgetiren 2013), signal processing (Sheniha 2013), chemical engineering (Sharma and Rangaiah 2013), power system (Zhang et al. 2015), image processing (Ali et al. 2014), and in other domains (Al-Dabbagh et al. 2014; Rakshit and Konar 2015; Das and Prasad 2015).

Nevertheless, the performance (Liu and Lampinen 2002) of the DE algorithm is sensitive to the mutation strategy, which exists many different DE trial vector generation strategies and respective control parameters such as the population size (NP), crossover rate (Cr) and the scale factor (F). In various search phases of the evolution process, these strategies often possess different searching capabilities. And, the best settings of the control parameters vary for different optimization problems, and for different requirements on consumption time and accuracy when the optimization problem is same. Therefore, it is necessary to find the most appropriate strategy and its corresponding parameters. To achieve this, however, a process of trial-and-error search is needed to be performed, which definitely will suffer a high computation cost. Computational costs may be induced by the fact that the population of DE is evolved through different regions in the search space, within which different strategies and respective different parameter settings (Qin et al. 2009). Recently, various offspring generation strategies and parameter adaptation mechanisms have been developed to enhance the reliability and robustness of DE. For example, Brest and Mernik (2008) presented the jDE algorithm, which owned self-adapting parameters and by encoding the parameters into each individual and adapting them by means of evolution, and obtained the mutation strategy as DE/rand/1. Qin and Suganthan (2005) presented a self-adaptive variant of DE (SaDE), where trial vector generation strategies were gradually self-adapted by learning from their prior experiences in generating promising solutions. Wang et al. (2011) proposed a systematic framework for combining different trial vector generation strategies, called composite DE (CoDE), in which three well-studied offspring generation strategies are coupled with three-parameter settings randomly to generate trial vectors in the following way. Mallipeddi et al. (2011) proposed an ensemble of mutation strategies and parameter values for DE (EPSDE). Mutation strategies from mutation strategies pool can involve corresponding parameters from control parameters’ values pool to produce their own offspring. Then, the optimized offspring can be obtained by further competing. Zhang and Sanderson (2009) presented JADE algorithm, which implemented a new mutation strategy DE/target-to-pbest/1 and updated control parameters in an adaptive manner. Based on JADE, Brown et al. (2015) proposed a new DE with small population, namely ${\upmu }\hbox {JADE}$. In ${\upmu }\hbox {JADE}$, a new mutation strategy DE/current-by-rand-to-pbest/1 is introduced. Kundu et al. (2014) presented a modified semi-adaptive DE, namely MSeDE. In MSeDE, a new mutation scheme DE/current-to-constr_best/1 and a new crossover scheme p-BCX are used. Yu et al. (2014) presents an adaptive DE algorithm, namely ADE. In ADE, a new mutation strategy DE/lbest/1 and a two-level adaptive parameter control scheme are used. The new strategy is a variant of DE/best/1 strategy, which multiple locally best individuals instead of one globally and the two-level adaptive parameter control scheme includes population-level and individual-level parameter control. Mohamed (2015) presented an improved DE algorithm, namely IDE. In IDE, a new triangular mutation strategy based on the convex combination vector of the triplet, which is defined by three randomly chosen vectors and the difference vector between the best and the worst individuals among the three randomly selected vectors is introduced. Liu et al. (2014) proposed a random-based differential evolution with neighborhood mutation, namely NRDE. In NRDE, two mutation schemes are used. The mutation schemes are random-based mutation scheme and neighborhood mutation scheme, the best vector of which will be replace the target vector. Han et al. (2013) presented a differential evolution with local information, namely DELI. In DELI, considering both global information and local information, a new mutation operation is applied to generate a mutated individual. In order to get an appropriate combination of strategies and control parameters for different problems, many other adaption techniques have been developed (Tang et al. 2015; Wang et al. 2014; Cai and Wang 2015; Mallipeddi and Lee 2015). Although different partial adaptation schemes have been proposed to overcome the trial-and-error procedure, many strategies are hard to maintain the balance between the global exploration and the local exploitation. Here, we demonstrate the superior performance of the proposed mutation strategy with various DE algorithms based on niche on maintaining he balance between the global exploration and the local exploitation.

In the paper, the clearing niche method (Petrowski 1996) is integrated with mutation strategies to enhance population diversity, improve the search ability, and accelerate the convergence rate of DE algorithms. The niche techniques (Petrowski 1996; Mahfoud 1995; Yin and Germay 1993; Petrowski and Genet 1999; Sareni and Krahenbuhl 1998) are regarded as effective methods to maintain the balance between both population diversity and the search domain. They aim at gathering the individuals on several peaks of fitness function in the population according to genetic likeness, and then permit DE to investigate those peaks in parallel. The individual with a high fitness in the niche is keeping its fitness, while the others in the niche are changed to reduce their fitness values sharply. Hence, the individuals in the population may be dispersed into the whole search space. Thus, some diversity can be maintained effectively during the generations in the population. Several niche techniques have been proposed, such as crowding methods (Mahfoud 1995), clustering-based methods (Yin and Germay 1993), speciation tree methods (Petrowski and Genet 1999), fitness-sharing methods (Sareni and Krahenbuhl 1998), and clearing methods (Petrowski 1996). Also, some niche-based DE algorithms have been presented. Thomsen (2004) presented a DE algorithm based on crowding and fitness-sharing scheme to tackle multimodal optimization, namely CrowdingDE and SharingDE. Different from conventional DE, CrowdingDE modifies the selection operation, in which the offspring replaces the most similar individual among a (the crowding factor) subset of the population. In the selection operation of SharingDE, all parents and offsprings are added to the population enlarging the population, and all the individuals are rescaled using the sharing function, then the individuals are sorted with the new fitness and the worst half of the population will be removed. Li (2005) extended DE with the notion of speciation for solving multimodal optimization problems, namely SDE. SDE locates multiple global optima simultaneously through the adaptive formation of multiple species. Each species is evolved by its own DE process, which tries to successively improve itself. Epitropakis et al. (2012) proposed two new DE mutation strategies, namely DE/nrand/1 and DE/nrand/2. In DE/nrand/1 and DE/nrand/2, the local information from the current population is incorporated into the mutation schemes, when each individual is evolved by applying its nearest neighbor individual as a base vector. Based on DE/nrand/1, Epitropakis et al. (2013) proposed a new niching DE algorithm with dynamic archive to overcome the population size influence and produce good performance almost independently of its population size, namely dADE/nrand/1 algorithm, which involves in incorporation between a control parameter adaptation technique and an external dynamic archive along with a re-initialization mechanism. The control parameter adaptation technique, proposed in the context of JADE algorithm (Zhang and Sanderson 2009), is designed to efficiently adapt the control parameters of the algorithm. Meanwhile, the external dynamic archive along with a re-initialization mechanism aims to alleviate the problem, that is, have to tune the population size and allow the algorithm to maintain good performance regardless of the population size value. The control parameter adaptation technique incorporates a dynamic archive proposed in Zhai and Li (2011), aiming to record good solutions found along with a re-initialization procedure to continue searching for additional good solutions in unexplored regions of the search space. Biswas et al. (2014) presents a DE variant with parent centric mutation that makes use of normalized search neighborhood and integrates with the proximity-based crowding technique, namely PNPCDE. The PNPCDE does not make use of the problem-dependent niching parameters (like niche radius), which are hardly determined the values. The mutation operator helps to maintain the population diversity at an optimum level by using well-defined local neighborhoods. Zhang et al. (2015) presented a DE with dynamic niche radius strategy, namely DNRDE. In DNRDE, the niche radius is adjusted dynamically to make the algorithm navigate from global exploration to local exploitation by a new two-stage annealing schedule. At first stage, exploration dominates the search process, and the radius decrease dynamically. When the niche radius reaches to a cutoff value, it will be stable. At second stage, exploitation takes over to enhance the quality of the acquired optima.

In this paper, the clearing niche method is used in the mutation strategy. Compared with other niche techniques, such as speciation tree and sharing fitness methods, the clearing niche method may maintain the population diversity effectively with a lower population size, and is also simpler to implement. However, in contrast to the speciation tree method, the clearing niche method needs a problem-dependent parameter, namely, the niche radius. In such case, the DE algorithm will be sensitive to niche radius, since its performance will be altered by changing the setting of niche radius, this impact of radius changes on algorithms will be demonstrated by numerical experiment in this paper.

The paper is organized as follows. In Sect. 2, we introduce the basic differential evolution and several well performance DE variants. In Sect. 3, we describe the proposed mutation strategy in details, and we introduce the change mutation strategies in four state-of-the-art DE variants. In Sect. 4, the paper lists the functions, and their corresponding simulated diagrams. In Sect. 5, numerical experiments are presented. In Sect. 6, the concluding remarks are contained.

2 Differential evolution algorithms

In this section, we describe the basic differential evolution and several variants of DE.

2.1 Brief description of basic differential evolution

There are several strategies of DE that are proposed in the literature (Zaharie 2009), one of which DE/rand/1/bin is used widely. Accordingly, we choose DE/rand/1/bin as example to introduce DE. Just like other Evolution Algorithms (EAs), DE uses mutation and crossover to generate new individuals. DE has four basic operations involving in initialization, mutation, crossover, and selection. The whole flow chart of DE is shown in Fig. 1. In DE-literature, a parent vector from the current generation is called target vector, a mutant vector obtained through the differential mutation operation is known as donor vector, and finally an offspring formed by recombining the donor with the target vector is called trial vector. The details of DE operations are described as follows.

(1) Coding. DE is a global optimization algorithm, and individuals in population are encoded using real number.

(2) Individual. NP denotes size of the population in DE. The ith individual at Gth generation is denoted by $X_{i,G} =[{x_{1,i,G} ,x_{2,i,G}, x_{3,i,G}, \ldots , x_{D,i,G}}]$, where D is dimension.

(3) Initializing population. The initial population (at $G=0)$ should cover the entire search space as much as possible by uniformly randomizing individuals within the search space constrained by the prescribed minimum and maximum bounds: $x_{j,\mathrm{min}} \in X_\mathrm{min} =\{{x_{1,\mathrm{min}} ,x_\mathrm{2,min} ,\ldots ,x_{D,\mathrm{min}}}\}$ and $x_{j,\mathrm{max}}\in X_{\mathrm{max}} =\{{x_{1,\mathrm{max}}, x_{2,\mathrm{max}}, \ldots ,x_{D,\mathrm{max}}}\}$. Hence the jth component of the ith individual should be initialized as $x_{j,i,0} =x_{j,\mathrm{min}} +{rand}_{i,j}(0,1)\times (x_{j,\mathrm{max}}-x_{j,\mathrm{min}})$.

(4) Mutation. Differential idea is embodied into mutation operation. According to the strategy DE/rand/1/bin, the general process of mutation is expressed by Eq. (1):

$$\begin{aligned} V_{i,G} =X_{r_1 ,G}+F\times \left( {X_{r_{2} ,G}-X_{r_{3} ,G}}\right) \end{aligned}$$

(1)

where i is the ith individual vector of current generation. $X_{r_1 ,G}$, $X_{r_{2}, G}$ and $X_{r_{3}, G}$ are other three individuals, which are from current generation, where $i\ne r_1 \ne r_{2}\ne r_{3}$. $V_{k,G}$ is donor vector. $F\in [0,1]$ is mutation control parameter.

There are usually following differential evolution strategies when crossover operation is bin:

DE/rand/1/bin: $V_{i,G}=X_{r_1 ,G} +F\times (X_{r_{2} ,G}-X_{r_{3}, G})$
DE/best/1/bin: $V_{i,G} =X_{\mathrm{best},G} +F\times (X_{r_1 ,G}-X_{r_{2}, G})$
DE/target-to-best/1/bin: $V_{i,G} =X_{i,G} +F\times (X_{\mathrm{best},G} -X_{i,G})+F\times (X_{r_1 ,G} -X_{r_{2} ,G})$
DE/best/2/bin: $V_{i,G} =X_{\mathrm{best},G} +F\times (X_{r_1,G} -X_{r_{2}, G})+F\times (X_{r_{3} ,G} -X_{r_{4} ,G})$
DE/rand/2/bin: $V_{i,G} =X_{r_1 ,G} +F\times (X_{r_{2} ,G} -X_{r_{3},G})+F\times (X_{r_{4} ,G} -X_{r_{5} ,G})$

where $X_{\mathrm{best},G}$ means the best individual in the Gth generation. $X_{i,G}$ means the ith individual in the Gth generation. $X_{r_1, G}$, $X_{r_{2}, G} $, $X_{r_{3} ,G} $, $X_{r_4 ,G} $, and $X_{r_{5}, G}$ are other five individuals, which are from current generation, where $i\ne r_1 \ne r_{2} \ne r_{3} \ne r_4 \ne r_{5} $.

(5) Crossover. Crossover probability is $Cr\in [0,1]$. Crossover means to swap the dimensions between the donor vector and the target vector controlled by crossover parameter Cr. Binomial crossover and exponential crossover are two different crossover strategies. Mahfoud (1995) analyzed the influence of crossover on the behavior of DE. The numerical experiments illustrate that main difference between them is the different distributions of the number of mutated components. Theoretical analysis shows that the behavior of exponential crossover variants is more sensitive to the problem dimension than binomial crossover variants’. Usually the binomial crossover is accepted, which is described as Eq. (2):

$$\begin{aligned} u_{j,i,G} =\left\{ {{\begin{array}{ll} v_{j,i,G}, &{}\quad \hbox {if}\left( rand_{i,j}(0,1)\le Cr\, or\, j=j_{rand}\right) \\ x_{j,i,G}, &{}\quad \hbox {otherwise}\\ \end{array}}}\right. \end{aligned}$$

(2)

where j is dimension, i is individual, G is generation. $v_{j,i,G} $ is coming from donor vector $V_{k,G} $, $x_{j,i,G}$ is coming from target vector $X_{k,G}. j_{rand} \in [1,2,\ldots , D]$ is a randomly chosen index.

(6) Selection. The offspring or trial vector $X_{i,G+1}$ can be obtained through comparing the fitness value of trial vector $U_{i,G}$ and target vector $X_{i,G}$ according to Eqs. (3) and (4).

$$\begin{aligned} X_{i,G+1}= & {} U_{i,G}\, \quad if\, f\left( {U_{i,G} } \right) \le f\left( {X_{i,G}}\right) \end{aligned}$$

(3)

$$\begin{aligned} X_{i,G+1}= & {} X_{i,G}\, \quad if\, f\left( {U_{i,G} } \right) >f\left( {X_{i,G} } \right) \end{aligned}$$

(4)

2.2 Some variants of DE

(1)
The jDE algorithm

Brest and Mernik (2008) proposed the jDE algorithm, in which the control parameters F and Cr are encoded into the individual $X_{i,G} =\langle \vec {x}_{i,G}, F_{i,G}, Cr_{i,G}\rangle $ and adjusted by two new arguments $\tau _{1}$ and $\tau _{2}$. They are calculated independently, as shown in Eqs. (5), (6)

$$\begin{aligned}&F_{i,G+1}=\left\{ {{\begin{array}{ll} F_{l,G} +rand_{1}\times F_{u,G},&{}\quad \hbox {if } rand_{2}<\tau _{1}\\ F_{i,G}, &{} \quad \hbox {otherwise} \\ \end{array}}}\right. \end{aligned}$$

(5)

$$\begin{aligned}&Cr_{i,G+1}=\left\{ {{\begin{array}{ll} rand_{3}, &{}\quad \hbox {if } rand_{4}<\tau _{2}\\ Cr_{i,G}, &{}\quad \hbox {otherwise} \\ \end{array}}}\right. \end{aligned}$$

(6)

where $rand_k$, $k\in \{1,2,3,4\}$ are uniformly distributed random values belonging to the range [0, 1]; $\tau _{1}$ and $\tau _{2}$ are constant values that represent the probabilities of parameters being adjusted; $F_{u,G}$ and $F_{l,G}$ are also constant values, which denote the upper and lower bounds of the parameters, respectively. The newly generated $F_{i,G+1}$ and $Cr_{i,G+1}$ are procured before the mutation is implemented. Thus, the scheme influences the mutation, crossover and selection operations of the new vector.

(2)
The JADE algorithm

Zhang and Sanderson (2009) proposed the JADE algorithm, in which a novel mutation strategy and an optional external archive are utilized to provide information of progress direction. This DE/target-to-best strategy uses multiple best solutions to balance the greediness of the mutation and the diversity of the population, which is generated using Eq. (7):

(7)

where $X_{best,G}^{p}$ is the individual that is randomly selected from the top 100p % of the current population, with $p\in (0, 1]$. $X_{i,G}$, $X_{\mathrm{best},G}^{p}$ and $X_{r1,G}$ are chosen from the current population P. $\tilde{X}_{r2,G}$ is randomly selected from the union, $P\cup A$, while A, an archive, is employed to store the recently explored inferior solutions. The archive operation is made very simplely to avoid significant computation overhead. The archive is initiated to be empty. Then, after each generation, the parent solutions that fail in the selection process are added to the archive. If the archive size exceeds a certain threshold, say NP, then some solutions are randomly removed from the archive to keep the archive size at NP. $F_{i,G}$ denotes the scaling factor and $Cr_{i,G}$ denotes the crossover rate associated with the ith individual and they are updated dynamically in each generation, as shown in Eqs. (8), (9).

$$\begin{aligned}&F_{i,G}=randc_{i}\left( u_F ,0.1\right) \end{aligned}$$

(8)

$$\begin{aligned}&Cr_{i,G}=randn_{i}\left( u_{Cr}, 0.1\right) \end{aligned}$$

(9)

$F_{i,G} $ and $Cr_{i,G}$ are generated according to a Normal distribution and a Cauchy distribution with associated mean values $u_{F}$ and $u_{Cr}$. The proposed two location parameters are initialized to be 0.5 and then updated at the end of each generation according to Eqs. (10), (11).

$$\begin{aligned}&u_F=\left( 1-c\right) \times u_F +c\times \mathrm{mean}_{L}\left( S_{F}\right) \end{aligned}$$

(10)

$$\begin{aligned}&u_{Cr}=\left( 1-c\right) \times u_{Cr}+c\times \mathrm{mean}_{A}\left( S_{Cr}\right) \end{aligned}$$

(11)

where c is a positive constant in the range(0,1); $S_F$ and $S_{Cr} $ denote the set of all successful mutation/crossover rates; $\mathrm{mean}_{A}(\cdot )$ indicates the usual arithmetic mean and $\mathrm{mean}_{L}(\cdot )$ returns the Lehmer mean shown as Eq. (12).

$$\begin{aligned} \mathrm{mean}_{L}\left( S_{F}\right) =\frac{\sum \nolimits _{i=1}^{\left| S_{F}\right| } F_{i}^{2}}{\sum \nolimits _{i=1}^{\left| S_{F}\right| }F_i} \end{aligned}$$

(12)

How to get archive A, $S_{F}$ and $S_{Cr}$.

In the selection process, A, $S_F $ and $S_{Cr}$ will be got.

$$\begin{aligned}&\hbox {If }\quad f(X_{i,G})\le f(U_{i,G})\quad X_{i,G+1}=X_{i,G}\\&\hbox {Else }\quad X_{i,G+1}=U_{i,G}; X_{i,G}\rightarrow A;Cr_{i}\rightarrow S_{Cr} ,F_i \rightarrow S_{F} \end{aligned}$$

(3)
The SaDE algorithm

Qin and Suganthan (2005) presented the SaDE, where one trial vector generation strategy was chosen from the candidate pool(“DE/rand/1”, “DE/rand/2”,“DE/target-to-best/2” and “DE/target-to-rand/1”), according to the probability learned from its success rate in generating promising solutions within a certain number of previous generations, called the learning period(LP). More specifically, these probabilities are initially equal and then gradually self-adapted upon Eq. (13)

$$\begin{aligned} P_{k,G}=\frac{S_{k,G}}{\sum \nolimits _{k=1}^K S_{k,G}} \end{aligned}$$

(13)

where $P_{k,G}$, $k=1,2,\ldots ,K$, denotes the probability of applying the kth strategy. Here K is the total number of strategies contained in the pool. $S_{k,G}$ is the success rate of the trial vector, which is generated by the kth strategy and successfully enters the next generation according to Eq. (14):

$$\begin{aligned} S_{k,G} =\frac{\sum \nolimits _{g=G-LP}^{G-1} ns_{k,g}}{\sum \nolimits _{g=G-LP}^{G-1} ns_{k,g}+\sum \nolimits _{g=G-LP}^{G-1} nf_{k,g}}+\varepsilon \end{aligned}$$

(14)

where $ns_{k,g}$ and $nf_{k,g}$ record the number of trial vectors generated by the kth strategy that are retained or discarded in the selection operation in the last LP generations. The small constant value $\varepsilon =0.01$ is used to avoid the possible null success rates. At each generation, for each solution in the current population, the parameters $F_{i,k}$ and $Cr_{i,k}$ are independently calculated upon Eqs. (15), (16).

$$\begin{aligned}&F_{i,k}=randn_i \left( {0.5,0.3}\right) \end{aligned}$$

(15)

$$\begin{aligned}&Cr_{i,k}=randn_i \left( {Crm_k ,0.1}\right) \end{aligned}$$

(16)

where coefficients are respectively generated for each individual by sampling their values from a normal distribution. Nevertheless, the mean value of Cr ($Crm_{k}$) is gradually self-adapted on the basis of a success rule.

(4)
The EPSDE algorithm

Mallipeddi et al. (2011) provided the EPSDE involving a pool of mutation strategies along with various combinations of parameters, which are employed for competing in order to produce successful offspring population. More concretely, the pool of strategies is formed with three schemes with diverse characteristics. The pool of Cr values is taken in the range [0.1, 0.9] in steps of 0.1, and the pool of F values is assigned in the range [0.4, 0.9] in steps of 0.1 as follows:

A pool of mutation strategies includes DE/best/2, DE/rand/1 and DE/target-to-rand/1;

A pool of F values includes $F=[0.4,0.5,0.6,0.7,0.8,0.9]$
A pool of Cr values includes $Cr=[0.1,0.2,0.3,0.4,0.5,0.6,0.7,0.8,0.9].$

In EPSDE, each individual of the initial population is randomly adopted with a mutation strategy and associated parameter values taken from the respective pools. When the trial vector performs better than the target vector, the combination of the mutation strategy and parameter values can be survived in the next generation. And the combination is also stored. Afterward, the target vector should be randomly reinitialized with a new mutation strategy from the respective pools or from the successful combinations stored with equal probability when the trial vector performs poorer.

3 Differential evolution algorithm based on niche

In this section, we propose some new mutation strategies which are based on clearing niche mechanism. It will be explained in the following sections.

3.1 The niche operation based on clearing mechanism

The main idea of the niche algorithm based on clearing mechanism is that the population is divided into a number of niches. And each subpopulation contains a dominant individual: the one that has the best fitness. If the dissimilarity between an individual and the dominant one in a given subpopulation is less than a threshold ${\updelta }$: the clearing radius, this individual, then, belongs to this subpopulation. The basic clearing algorithm preserves the fitness of the dominant individual, but resets the fitness of all the other individuals of the same subpopulation. Consequently, the clearing procedure makes the whole resource of a niche belong to a single individual: the winner. The winner takes all rather than sharing resources with the other individuals of the same niche, and other individuals as losers will take a punishment mechanism, this mechanism differs from the fitness-sharing methods. It is also possible to generalize the clearing algorithm by accepting several winners in each niche. The capacity of a niche is defined as the maximum number of winners that this niche can accept. We assume that the population size is NP, the individuals which perform better in each niche are winners, in contrast, perform worse are loser. Finally, process of flow is shown in Fig. 2.

3.2 Mutation strategies based on clearing niche

We all know that the mutation strategies play an important role in the search capability and convergence rate of DE. However, many mutation strategies are hard to maintain a balance among a good population diversity, a good global exploration ability, a good local exploitation ability, and a fast convergence rate. Clearly, from the equation of the basic mutation strategy DE/rand/1/bin, it can be seen that three vectors are randomly chosen, one of which is the base vector. As a result, DE/rand/1/bin is able to maintain the population diversity and the global search ability, but it cannot guarantee the local search ability and the convergence rate. From the equation of the basic mutation strategy DE/best/1/bin, it can be observed that the base vector is the globally best vector and the other two vectors are randomly chosen. In this way, all the vectors are guided by the best vector. Such greedy strategy is helpful for the local search ability and the convergence rate. However, the greedy strategy may lose its population diversity and global search ability. In order to maintain a balance among the population diversity, the global exploration ability, the local exploitation ability, and the convergence rate, we present some new mutation strategies, which combine with the clearing niche mechanism.

In the proposed strategies, the population is divided into some niches according to the sort of the fitness values. Each niche contains several winners whose fitness values are better than others. And all the winners will make up a new subpopulation. Note that the number of niches may be change and the number of winners in each niche will be kept unchanged during the evolution of algorithms. As aforementioned, the mutation strategy DE/best/1 obtains a best vector as the base vector. In the new mutation strategies, the base vector is selected from the group which the subpopulation instead of the entire population. Therefore, all the vectors are guided by the subpopulation which is made of several locally best vectors rather than the randomly vector or the single globally best vector. At the same time, the difference vectors involved in the mutation strategies are selected from the entire population. In such way can maintain a balance among the population diversity, the global exploration ability, the local exploitation ability, and the convergence rate.

Differing from the existing clearing niche mechanisms, we adopt a new idea of dealing with the distance. Here, it calculates the distances from the best individual and other individuals in current status respectively by using Euclidean distance. Moreover, a different normalization method is proposed to calculate the relative distance based on maximum and minimum distances for each niche. In addition, different niches vary maximum distance and minimum distance. Consequently, the clearing radius is the same whereas the ranges of niches are different. Hence, the number of each niche is not only related to individual density but also related to the maximum distance and minimum distance.

For example, if the population size is set as 50, the dimension is set as 2, the clearing radius is set as 0.3. And the fitness is the minimum value of sum of squares. We can get the two-dimensional drawing of population. Then, we can see the range of each niche in Fig. 3. The figure can be found that the range of each niche is different.

When we combine the clearing niche mechanism with several classical mutation strategies, we set a changed solution. In the changed solution, the target individual will have priority, and the best individual will be considered next. At last, if the mutation strategy do not contain the target or best individual, the rand individual will be changed which does not multiply the scaling factor F. After modifying, the usual differential evolution strategies DE/rand/1, DE/best/1, DE/target-to-best/1, DE/best/2, and DE/rand/2 can be seen in the following.

DE/rand/1/bin:

$$\begin{aligned} V_{i,G} =X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(17)

DE/best/1/bin:

$$\begin{aligned} V_{i,G} =X_{\mathrm{best},G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(18)

DE/rand/1 based on clearing niche mechanism, we name it DE/clear niche/1. And DE/best/1 based on clearing niche mechanism, we also name it DE/ clear niche /1. As illustrated in Fig. 4, in DE/clear niche/1, a mutation vector is generated in the following manner:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(19)

where $X_{c,G}$ is randomly chosen individual from subpopulation which preserves the dominant individual by clearing niche mechanism. $X_{best,G}$ is the best individual in the current generation. $X_{r1,G} \quad X_{r2,G} $, and $X_{r3,G}$ are randomly chosen three individuals from the current generation, where $r1\ne r2\ne r3\ne i$. G is the current generation. $V_{i,G} $ is the ith donor vector. F is the scaling factor.

DE/target-to-rand/1/bin:

$$\begin{aligned} V_{i,G} =X_{i,G} {+}F\times \left( {X_{r3,G} {-}X_{i,G} } \right) {+}F\times \left( {X_{r1,G} -X_{r2,G} } \right) \nonumber \\ \end{aligned}$$

(20)

DE/target-to-best/1 based on clearing niche mechanism, we name it DE/ clear niche -to-best/1. In DE/ clear niche-to-best/1, a mutation vector is generated in the following manner:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r3,G} -X_{c,G} +X_{r1,G} -X_{r2,G} } \right) \nonumber \\ \end{aligned}$$

(21)

where $X_{c,G} $ is randomly chosen individual from subpopulation which preserves the dominant individual by clearing niche mechanism. $X_{r1,G} \quad X_{r2,G} $, and $X_{r3,G} $ are randomly chosen three individuals from the current generation, where $r1\ne r2\ne r3\ne i$. G is the current generation. $V_{i,G} $ is the ith donor vector. F is the scaling factor.

DE/best/2/bin:

$$\begin{aligned}&V_{i,G}\nonumber \\&=X_{best,G} {+}F\times \left( {X_{r1,G} -X_{r2,G}} \right) {+}F\times \left( X_{r3,G} {-}X_{r4,G}\right) \nonumber \\ \end{aligned}$$

(22)

DE/rand/2/bin:

$$\begin{aligned} V_{i,G} =X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) +F\times \left( {X_{r4,G} -X_{r5,G} } \right) \end{aligned}$$

(23)

DE/best/2 based on clearing niche mechanism, we name it DE/ clear niche /2. And DE/rand/2 based on clearing niche mechanism, we also name it DE/ clear niche /2. In DE/ clear niche /2, a mutation vector is generated in the following manner:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r1,G} -X_{r2,G} +X_{r3,G} -X_{r4,G} } \right) \nonumber \\ \end{aligned}$$

(24)

where $X_{c,G} $ is randomly chosen individual from subpopulation which preserves the dominant individual by clearing niche mechanism. $X_{best,G} $ is the best individual in the current generation. $X_{r1,G} , X_{r2,G} , X_{r3,G} , X_{r4,G}$, and $X_{r5,G}$ are randomly chosen five individuals from the current generation, where $r1\ne r2\ne r3\ne r4\ne r5\ne i$. G is the current generation. $V_{i,G} $ is the ith donor vector. F is the scaling factor.

We can see the flowchart of the new DE combined with clearing niche mechanism in Fig. 5 when the mutation strategy is DE/rand/1.

3.3 DE variants based on clearing niche

We can see the flowchart of DE with the new mutation strategy based on clearing niche mechanism in Fig. 5. And when the new mutation strategy based on clearing niche mechanism is applied to variants of DE, the corresponding mutation strategy of DE will change. It can be seen in the following section.

(1)
The jDE algorithm

In jDE algorithm, the mutation strategy is DE/rand/1. We can see change of the mutation strategy as following.

The initial mutation strategy:

$$\begin{aligned} V_{i,G} =X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(17)

The mutation strategy based on clearing niche mechanism:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r1,G} -X_{r2,G} } \right) \end{aligned}$$

(19)

(2)
The JADE algorithm

In JADE algorithm, the mutation strategy is DE/target-to-pbest/1. We can see change of the mutation strategy as following.

The initial mutation strategy:

$$\begin{aligned} V_{i,G} =X_{i,G} +F\times \left( {X_{\mathrm{best},G}^p -X_{i,G} +X_{r1,G} -\tilde{X} _{r2,G} } \right) \end{aligned}$$

(25)

The mutation strategy based on clearing niche mechanism:

$$\begin{aligned} V_{i,G} =X_{c,G}+F\times \left( {X_{\mathrm{best},G}^p -X_{c,G} +X_{r1,G} -\tilde{X} _{r2,G}}\right) \end{aligned}$$

(26)

(3)
The SaDE algorithm

In SaDE algorithm, the mutation strategies are DE/rand/1, DE/rand/2, DE/target-to-best/2, DE/target-to-rand/1. We can see change of the mutation strategy as following.

The initial mutation strategy:

$$\begin{aligned} V_{i,G}= & {} X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(17)

$$\begin{aligned} V_{i,G}= & {} X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} +X_{r4,G} +X_{r5,G} } \right) \nonumber \\ \end{aligned}$$

(23)

$$\begin{aligned} V_{i,G}= & {} X_{i,G} +F\times \left( X_{best,G} -X_{i,G}\right. \nonumber \\&\left. +X_{r1,G}-X_{r2,G} +X_{r3,G} -X_{r4,G}\right) \end{aligned}$$

(27)

$$\begin{aligned} V_{i,G}= & {} X_{i,G} +F\times \left( {X_{r3,G} -X_{i,G} +X_{r1,G} -X_{r2,G}}\right) \nonumber \\ \end{aligned}$$

(20)

The mutation strategy based on clearing niche mechanism:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r1,G} -X_{r2,G} } \right) \end{aligned}$$

(19)

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r2,G} -X_{r3,G} +X_{r4,G} +X_{r5,G} } \right) \quad \end{aligned}$$

(24)

$$\begin{aligned} V_{i,G}= & {} X_{i,G} +F\times \left( X_{c,G} -X_{i,G} +X_{r1,G}\right. \nonumber \\&\left. -X_{r2,G} +X_{r3,G} -X_{r4,G}\right) \end{aligned}$$

(28)

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r3,G} -X_{c,G} +X_{r1,G} -X_{r2,G} } \right) \quad \end{aligned}$$

(21)

(4)
The EPSDE algorithm

In EPSDE algorithm, the mutation strategies are DE/rand/1, DE/best/2, DE/target-to-rand/1. We can see changes of the mutation strategy as following.

The initial mutation strategy:

$$\begin{aligned} V_{i,G} =X_{r1,G} +F\times \left( {X_{r2,G} -X_{r3,G} } \right) \end{aligned}$$

(17)

$$\begin{aligned} V_{i,G} =X_{best,G} +F\times \left( {X_{r1,G} -X_{r2,G} +X_{r3,G} -X_{r4,G} } \right) \nonumber \\ \end{aligned}$$

(22)

$$\begin{aligned} V_{i,G} =X_{i,G} +F\times \left( {X_{r3,G} -X_{i,G} +X_{r1,G} -X_{r2,G}}\right) \nonumber \\ \end{aligned}$$

(27)

The mutation strategy based on clearing niche mechanism:

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r1,G} -X_{r2,G}}\right) \end{aligned}$$

(19)

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r1,G} -X_{r2,G} +X_{r3,G}-X_{r4,G}}\right) \nonumber \\ \end{aligned}$$

(24)

$$\begin{aligned} V_{i,G} =X_{c,G} +F\times \left( {X_{r3,G} -X_{c,G} +X_{r1,G} -X_{r2,G}}\right) \nonumber \\ \end{aligned}$$

(28)

4 Test functions

4.1 Minimization optimization benchmark functions

This section lists the global minimization benchmark functions which are used to evaluate the performance of DE variants. In the section, 17 global minimization benchmark functions are chosen as test functions. The 17 test functions $(f_1 -f_{17})$ are dimension-wise scalable (Liang et al. 2005).

(1) Shifted Sphere function, defined as

$$\begin{aligned} f_{1}(x)=\sum \limits _{i=1}^D z_i^2 ,\quad z=x-o \end{aligned}$$

(29)

With $o=(o_1 ,o_2 ,\ldots ,o_{D})$ is the shifted global optimum, global optimum is $x^{*}=o$ and $f(x^{*})=0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(2) Shifted Schwefel’s Problem 1.2, defined as

$$\begin{aligned} f_2 (x)=\sum \limits _{i=1}^{D}\left( \sum \limits _{j=1}^i z_{j}\right) ^{2},\quad z=x-o \end{aligned}$$

(30)

With $o=\left( o_{1}, o_{2},\ldots ,o_{D}\right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}}\right) =0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(3) Shifted Rotated High Conditioned Elliptic function, defined as

$$\begin{aligned} f_3(x )=\sum \limits _{i=1}^D \left( {10^{6}} \right) ^{\frac{i-1}{D-1}}z_i^2, \quad z=\left( {x-o}\right) *M \end{aligned}$$

(31)

With $o=\left( {o_1 ,o_2 ,\ldots , o_D}\right) $ is the shifted global optimum, M is a orthogonal rotation matrix, global optimum is $x^{*}=o$ and $f\left( x^{*}\right) =0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(4) Shifted Schwefel’s Problem 1.2 with Noise in Fitness, defined as

$$\begin{aligned} f_4 (x){=}\left( {\sum \limits _{i=1}^D\left( {\sum \limits _{j=1}^i z_j } \right) ^{2}}\right) *\left( {1+0.4*\left| {N\left( 0,1\right) }\right| } \right) , z=x-o \end{aligned}$$

(32)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D}\right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}} \right) =0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(5) Shifted Rosenbrock’s Function, defined as

$$\begin{aligned} f_5 (x)=\sum \limits _{i=1}^{D-1}\left( {100*\left( {z_i^2 -z_{i+1}}\right) ^{2}+\left( {z_i -1}\right) ^{2}}\right) , z=x-o \end{aligned}$$

(33)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D}\right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}} \right) =0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(6) Shifted Rotated Griewank’s Function without Bounds, defined as

$$\begin{aligned} f_6 (x)=\sum \limits _{i=1}^D \frac{z_i^2 }{4000}-\prod \limits _{i=1}^D \cos \left( {\frac{z_i }{\sqrt{i}}} \right) +1, \quad z=\left( {x-o} \right) *M \end{aligned}$$

(34)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D } \right) $ is the shifted global optimum, M is a orthogonal rotation matrix, global optimum is $x^{*}=o$ and $f\left( x^{*}\right) =0$ for $0\le x_{i} \le 600$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(7) Shifted Rotated Ackley’s Function with Global Optimum on Bounds, defined as

$$\begin{aligned} f_7 (x)= & {} -20*\hbox {exp}\left( {-0.2*\sqrt{\frac{1}{D}\sum \nolimits _{i=1}^D z_i^2 }} \right) \nonumber \\&-\hbox {exp}\left( {\frac{1}{D}\sum \limits _{i=1}^D \cos \left( {2*\pi *z_i}\right) } \right) +20+e, \nonumber \\ z= & {} \left( x-o\right) *M \end{aligned}$$

(35)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D}\right) $ is the shifted global optimum, M is a orthogonal rotation matrix, global optimum is $x^{*}=o$ and $f\left( x^{*}\right) =0$ for $-32\le x_i\le 32$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(8) Shifted Rastrigin’s Function, defined as

$$\begin{aligned} f_8 (x)=\sum \limits _{i=1}^D \left( {z_i^2 -10*\cos \left( {2*\pi *z_i}\right) +10}\right) , \quad z=x-o \end{aligned}$$

(36)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D}\right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}} \right) =0$ for $-5\le x_i \le 5$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(9) Schwefel’s Problem2.13, defined as

$$\begin{aligned} f_9= & {} \sum \limits _{i=1}^D \left( {A_i -B_i (x)} \right) ^{2}\nonumber \\ A_i= & {} \sum \limits _{j=1}^D \left( {a_{ij} \sin \alpha _j +b_{ij} \cos \alpha _j}\right) , \\ B_i (x)= & {} \sum \limits _{j=1}^D \left( {a_{ij} \sin x_j +b_{ij} \cos x_j } \right) \nonumber \end{aligned}$$

(37)

With ${\upalpha }=\left[ {\alpha _1 ,\alpha _2 ,\ldots ,\alpha _{D}}\right] $, $\alpha _j \in \left[ {-\pi ,\pi } \right] $; With global optimum $x^{*}=\alpha $ and $f\left( {x^{*}} \right) =0$ for $-\pi \le x_i \le \pi $. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(10) Shifted Expanded Griewank’s plus Rosenbrock’s Function, defined as

$$\begin{aligned} f_{10}(x)= & {} f_7 \left( {f_2\left( {z_1 ,z_2}\right) }\right) +f_7 \left( {f_2 \left( {z_2 ,z_3}\right) }\right) +\cdots \nonumber \\&+f_7\left( {f_2 \left( {z_{D-1} ,z_D}\right) }\right) +f_7 \left( {f_2\left( {z_D ,z_1}\right) }\right) \nonumber \\ z= & {} x-o+1 \end{aligned}$$

(38)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D } \right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}} \right) =0$ for $-3\le x_i \le 1$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(11) Shifted Rotated Expanded Scaffer’s F6 Function, defined as

$$\begin{aligned}&F\left( {x,y}\right) =0.5+\frac{\left( {\sin ^{2}\left( {\sqrt{x^{2}+y^{2}}}\right) -0.5} \right) }{\left( {1+0.001\times \left( {x^{2}+y^{2}}\right) } \right) ^{2}} \end{aligned}$$

(39)

$$\begin{aligned}&f_{11}=F\left( {z_1 ,z_2 } \right) +F\left( {z_2 ,z_3 } \right) +\cdots +F\left( {z_{D-1} ,z_D } \right) \nonumber \\&~~~~~~~~~~+F\left( {z_D ,z_1 }\right) , \quad z=\left( {x-o} \right) *M \end{aligned}$$

(40)

With $o=\left( {o_1 ,o_2 ,\ldots ,o_D } \right) $ is the shifted global optimum, global optimum is $x^{*}=o$ and $f\left( {x^{*}} \right) =0$ for $-100\le x_i \le 100$. Three-dimensional graph corresponding to this function is shown in Fig. 6.

(12) Hybrid Composition Function, defined as

The functions $f_{12}$ (CF1), $f_{13} $ (CF7), $f_{14}$ (CF8), $f_{15} $ (CF9), $f_{16} $ (CF10) and $f_{17}$ (CF11) are composed by using 10 different functions respectively. Their global optimums are easy to find once the global basins are found. The details of constructing such functions are presented in Liang et al. (2005). And three-dimensional graph corresponding to these six functions is shown in Fig. 6.

Among the above 17 benchmark problems, functions $f_1-f_4 $ are unimodal and functions $f_5-f_9 $ are basic multimodal functions, $f_{10} $ and $f_{11} $ are expanded multimodal functions, and $f_{12}-f_{17} $ are hybrid composition functions. The optimum value, position of the global optima, and initialization ranges for these 17 benchmark problems are provided in Table 1.

Table 1 Global optimum and initialization ranges for the benchmark functions

New mutation strategies of differential evolution based on clearing niche mechanism

Abstract

Graphical Abstract

Similar content being viewed by others

An adaptive mutation strategy correction framework for differential evolution

Dual mutations collaboration mechanism with elites guiding and inferiors eliminating techniques for differential evolution

Refining differential evolution with mutation rate and neighborhood weight local search

Explore related subjects

1 Introduction

2 Differential evolution algorithms

2.1 Brief description of basic differential evolution

2.2 Some variants of DE

3 Differential evolution algorithm based on niche

3.1 The niche operation based on clearing mechanism

3.2 Mutation strategies based on clearing niche

3.3 DE variants based on clearing niche

4 Test functions

4.1 Minimization optimization benchmark functions

4.2 Multimodal optimization benchmark functions

5 Experimental studies

5.1 Accuracy study

5.1.1 Experimental setup

5.1.2 Comparison with basic DE algorithms

5.1.3 Comparison with advanced DE variants

5.1.4 Discussion about the choices the clearing radius for DE algorithms with new mutation strategies

5.2 Multimodal study

5.2.1 Experimental setup

5.2.2 Evaluation criterions

5.2.3 Result and analysis

5.3 Study for EED problem

5.3.1 Problem formulation

5.3.2 Experimental setup

5.3.3 Result and analysis

6 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Human and animals participants

Informed consent

Additional information

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation