Re-sampled inheritance search: high performance despite the simplicity

Caraffini, Fabio; Neri, Ferrante; Passow, Benjamin N.; Iacca, Giovanni

doi:10.1007/s00500-013-1106-7

Re-sampled inheritance search: high performance despite the simplicity

Methodologies and Application
Published: 18 August 2013

Volume 17, pages 2235–2256, (2013)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Soft Computing Aims and scope Submit manuscript

Re-sampled inheritance search: high performance despite the simplicity

Download PDF

Fabio Caraffini^1,2,
Ferrante Neri^1,2,
Benjamin N. Passow³ &
…
Giovanni Iacca⁴

363 Accesses
26 Citations
Explore all metrics

Abstract

This paper proposes re-sampled inheritance search (RIS), a novel algorithm for solving continuous optimization problems. The proposed method, belonging to the class of Memetic Computing, is very simple and low demanding in terms of memory employment and computational overhead. The RIS algorithm is composed of a stochastic sample mechanism and a deterministic local search. The first operator randomly generates a solution and then recombines it with the best solution detected so far (inheritance) while the second operator searches in an exploitative way within the neighbourhood indicated by the stochastic operator. This extremely simple scheme is shown to display a very good performance on various problems, including hard to solve multi-modal, highly-conditioned, large scale problems. Experimental results show that the proposed RIS is a robust scheme that competitively performs with respect to recent complex algorithms representing the-state-of-the-art in modern continuous optimization. In order to further prove its applicability in real-world cases, RIS has been used to perform the control system tuning for yaw operations on a helicopter robot. Experimental results on this real-world problem confirm the value of the proposed approach.

Compact Optimization Algorithms with Re-Sampled Inheritance

Pareto-like sequential sampling heuristic for global optimisation

Article Open access 29 May 2021

Adaptive ε-Sampling and ε-Hood for Evolutionary Many-Objective Optimization

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Memetic Computing (MC) is a subject in computational science that studies algorithmic structures composed of heterogeneous operators, (Neri et al. 2011b). On an abstract level, every set of multiple operators coordinated within a certain structure for solving a given problem can be seen as a MC approach. Although this definition may appear excessively broad (Ong et al. 2010; Neri and Cotta 2012), in our view the description of algorithms as structured collections of operators (memes) has both conceptual and practical implications.

The concept of MC originates with the definition of Memetic Algorithm (MA), (see Moscato and Norman 1989; Moscato 1989). The first MAs were simple Genetic Algorithms (GAs) hybridized with a local search component for tackling the Travelling Salesman Problem (TSP). Although the idea of hybridizing algorithms was not totally new, (see e.g. Goldberg 1989), the visionary representation of the transmission of knowledge amongst subcomponents of an algorithm inspired a long-lasting discussion in the computer science community. An ex-post formalization of the definition of MA was given in Hart et al. (2004) where a MA is stated to be an algorithm composed of an evolutionary framework and one (or more) local search components activated within the generation cycle of the external framework. Several MAs (according to this definition) have been successfully used in various fields of applied science and engineering.

For example, in Joshi and Sanderson (1999) an ad-hoc MA based on a Differential Evolution (DE) framework has been proposed for solving the multi-sensor fusion problem while in Rogalsky and Derksen (2000) another DE based MA is designed to tackle an aerodynamic design problem. In Zamuda et al. (2011) a DE scheme for plant model reconstruction is proposed. A memetic solution for studying a material structure is given in Fan et al. (2007). In Caponio et al. (2007), Neri and Mininno (2010) domain-specific MAs are proposed for solving a control engineering problem with reference to electric motors and robotics. In Ong and Keane (2004) an aerodynamic design problem is considered. Biological and medical problems are addressed by means of MC approaches are also very popular, (see e.g. Abbass 2002; Neri et al. 2007a, b). MAs for scheduling and planning problems have been proposed (in e.g. Hasan et al. 2009; Lim et al. 2008; Tan et al. 2007).

During the latest years, MAs have become very popular in multiple contexts. As highlighted in Neri and Cotta (2012), one important reason behind the MA success is the diffusion of the No Free Lunch Theorems (NFLTs) (Wolpert and Macready 1997). These theorems prove that the average performance of any pair of algorithms $A$ and $B$ across all possible problems is identical. Strictly speaking, the proof of NFLTs (Auger and Teytaud 2007) are made for discrete problems and under the hypothesis that both the algorithms $A$ and $B$ are non-revisiting, i.e. the algorithms do not perform the fitness evaluation of the same candidate solution more often than once during the optimization run. Although a rigorous verification of these hypotheses is often not realistic, computer scientists accepted the idea that there is no universal optimizer because if an algorithm performs well on a certain class of problems, then it necessarily pays for that with degraded performance on the set of all remaining problems. As a trivial consequence, each problem should be analysed and a proper algorithm that addresses the specific features of that problem should be designed. Considering the non-specificity of the general structure of a MA, practitioners soon realized that proper hybridizations aiming at addressing specific problem features (such as noise, multi-modality etc.) would have helped them to solve problems that appeared hard for traditional paradigms.

Memetic Computing (MC) extends the concept of MA taking into account algorithms that are not population-based, (see e.g. Neri et al. 2011a), or that employ external support operators such as machine learning components, (see e.g. Handoko et al. 2010). In other words MC is an umbrella name which includes many modern algorithms composed of multiple operators and would not fit within the MA definition given in Hart et al. (2004). From a philosophical viewpoint, however, the real breakthrough of MC is the view of algorithms as structures whose bricks composing them are operators: this idea opens, for example, the exciting perspective of automatic algorithm generation, (see Neri et al. 2011b; Ong et al. 2010; Meuth et al. 2009).

The presence of multiple components and coordination schemes usually makes MC approaches rather complex and demanding in terms of computational overhead. For example, paper (Molina et al. 2010a) proposes a MA that, although very efficient, requires the usage of multiple covariance matrices, thus resulting into a very high computational overhead and memory employment, especially in high dimensions. In Montes de Oca et al. (2009) a Particle Swarm Optimization (PSO) that includes multiple enhancing mechanisms collected by other variants in literature is proposed. In Vrugt et al. (2009); Peng et al. (2010) optimization algorithms composed of multiple popular meta-heuristics are proposed. The coordinated employment of multiple algorithms, and more generally multiple strategies, is encompassed within the idea of ensemble (Mallipeddi et al. 2010, 2011), where different strategies concur, by means of a self-adaptive or randomized mechanism, to the optimization of the same fitness function. A similar approach is proposed in Zamuda and Brest (2012) where multiple strategies are combined with a population size reduction to in order to tackle industrial problems. In Nguyen et al. (2009a), yet multiple algorithms are considered, while their coordination is performed by means of a success probability criterion (by following a principle similar to the meta-Lamarckian learning (Ong and Keane 2004). In Nguyen et al. (2009b) multiple algorithmic components are coordinated by means of the structural mapping of the population.

Although in some cases, complex algorithmic implementations can lead to successful results, unnecessary complexity during the design phase should be strictly avoided for the following four major reasons.

(1)
Algorithms composed of multiple parts and containing many parameters could be hard to control. The setting of many parameters whose values heavily affects the performance (for a given problem) is often a complicated issue since the optimal setting of each parameter is likely to depend on the setting of the other parameters, (see Eiben and Smit 2011).
(2)
Complex algorithms can be hard to understand in terms of functioning. More specifically, some algorithms despite their performance on some problems are so complex that it is nearly impossible to interpret their functioning and understand the reasons behind their success. If there is no proper understanding of the working principles of an algorithm, the scheme risks to be highly specialized and unexpectedly fail when the problem changes. While the specialization is not necessarily a drawback of an algorithm, the lack of understanding of the algorithmic working principles and thus the difficulty of taking efficient countermeasures to adapt an algorithm to a new situation (e.g. a new dimensionality value) is definitely a limitation of complex schemes.
(3)
Some complex algorithms include computationally expensive components. This may result into a high computational overhead which may super-linearly depend on the problem dimensionality. For example, those algorithms that make use of covariance or distance matrices are characterized by a computational overhead that grows quadratically with the dimensionality of the problem. These algorithms may be unacceptably expensive in large scale optimization problems.
(4)
In other cases, modern algorithms require machine learning structures, archives, and learning components for the supervision of the operators. In such cases, the algorithm can be expensive in terms of memory consumption, thus being impractical for those problems that are characterized by a limited hardware, such as micro-controllers and embedded systems. Obviously, some algorithms can present both, a high computational overhead and high memory requirement.

Motivated by these reasons, paper (Iacca et al. 2012a) introduces the concept of Ockham’s Razor in MC, stating that unnecessary complex structures should be avoided as properly designed, simple algorithms can perform as well as complex ones. In addition, in Iacca et al. (2012a) an implementation of a novel algorithm, namely Three Stage Optimal Memetic Exploration (3SOME), is proposed. The 3SOME algorithm is a fairly simple scheme that makes use of three operators to progressively perturb a single solution. Despite its simplicity, 3SOME displays a competitive performance with respect to other MC approaches, and modern population-based algorithms. This successful implementation has been further studied and improved. Paper (Neri et al. 2012) compares the simplistic meme coordination scheme of 3SOME with the meta-Lamarckian learning (Ong and Keane 2004) and concludes that the simplistic 3SOME coordination is not worse than the adaptive leaning. Papers (Poikolainen et al. 2012a; Caraffini et al. 2012a) propose the integration of components for handling non-separability within a 3SOME framework. A marginally improved version of the original 3SOME is presented in Poikolainen et al. (2012b), where a modified operator enhances the exploitation features of the algorithm. Finally, paper (Caraffini et al. 2012b) studies the 3SOME algorithmic structure by comparing 3SOME variants holding the same structure but encompassing different operators. As a result, algorithms with the same structure but different operators appeared to display an overall similar performance over various problems.

It is worth mentioning that a way before the formulation of the Ockham’s Razor in algorithmic contexts and the diffusion of the MC terminology, simple structures perturbing single solutions by means of diverse operators have been proposed in the literature. For example, in Yao et al. (1999) a modified Evolutionary Programming (EP) scheme that cooperatively-competitively makes use of both Gaussian and Cauchy distributions in order to generate a new trial individuals. An evolution of this approach is presented in Lee and Yao (2004) where the EP is empowered by the Lévy distribution within the mutation operator.

The present paper, by following the Ockham’s Razor principle presented in Iacca et al. (2012a) and the considerations of the importance of the algorithmic structures reported in Caraffini et al. (2012b), proposes a novel algorithmic implementation that further attempts to be a simple and efficient alternative to modern complex algorithms. The proposed algorithm, namely re-sampled inheritance search (RIS), makes use of only two operators that progressively perturb a single solution, combined in a simplified structure with respect to that studied in Caraffini et al. (2012b). One of these operators is a modified version of one local search component used in Tseng and Chen (2008) while the second is a re-sampling mechanism flowed by an exponential crossover implemented in the fashion of DE (Price et al. 2005).

In addition, this paper applies the proposed algorithm to the control problem of a helicopter robot. In this case, a real-world application, the autonomous nature of the hardware would impose an on-board implementation of the optimization algorithm. The system would benefit from a simple algorithm that is characterized by a modest memory requirement and computational overhead. This real-world example highlights how the proposed algorithm is a simple scheme yet capable to display a performance competitive with modern complex algorithms.

The remainder of this article is organized in the following way. Section 2 introduces the RIS components, their coordination, and explains the motivation behind the design choices. Section 3 presents, on a diverse set of test problems belonging to four popular benchmarks, the RIS performance with respect to that of modern algorithm. Section 4 describes the application of the proposed algorithm to a real-world engineering problem in the field of mobile robotics. Finally, Sect. 5 gives the conclusions of this study.

2 Re-sampling inheritance search

Without loss of generality, in the following we refer to the minimization problem of an objective function $f(\mathbf{x})$, where the candidate solution $\mathbf{x}$ is a vector of $n$ design variables (or genes) in a hyper-box decision space $\mathbf{D}=[\mathbf{a},\mathbf{b}]$, with $\mathbf{a}$ and $\mathbf{b}$ respectively lower and upper bound vectors. Let us indicate with $x[i]$ the $i\mathrm{th}$ element of the vector $\mathbf{x}$. At an abstract level, an optimization algorithm can be seen as a mathematical procedure that progressively perturbs one or more candidate solutions in order to detect the optimum of the objective function. Let us indicate with $\mathbf{x}_e$ (where $e$ stands for “elite”) the best solution (or population of best solutions) detected at a given moment of the search, and with $\mathbf{x}_t$ the trial solution(s), i.e. the candidate solution perturbed by an operator (or a set of operators).

The proposed Re-sampling Inheritance Search (RIS) is an extremely simple algorithm that makes use of two operators to perturb a single solution. The proposed algorithm randomly samples an initial solution $\mathbf{x}_e$ within the decision space $\mathbf{D}$. The two operators proposed in the following two sub-sections are applied in order to perturb $\mathbf{x}_e$.

2.1 Re-sampling with inheritance

This operator, at first, randomly generates a solution $\mathbf{x}_t$ within the decision space $\mathbf{D}$. Then, a perturbation of $\mathbf{x}_e$ is performed by means of the exponential crossover in the fashion of Differential Evolution (Neri and Tirronen 2010; Zaharie 2009). More specifically, one gene from $\mathbf{x}_e$ is randomly selected. This gene replaces the corresponding gene within the trial solution $\mathbf{x}_t$. Then, a set of random numbers between $0$ and $1$ are generated. As long as $rand\left( 0,1\right) \le Cr$, where the crossover rate $Cr$ is a parameter affecting the number of transferred genes (see below), the design variables from the elite $\mathbf{x}_e$ are copied into the corresponding positions of the trial solution $\mathbf{x}_t$, starting from the initial gene. As soon as $rand\left( 0,1\right) > Cr$, the copy process is interrupted. Thus, all the remaining design variables of the offspring are those initially sampled (belonging to the original $\mathbf{x}_t$). The individual is handled as a cyclic buffer, i.e. when the $n\mathrm{th}$ variable is reached during the copy process the next to be copied is the first one. When the trial solution $\mathbf{x}_t$ has been generated, its fitness is compared with that of $\mathbf{x}_e$. If the newly generated solution outperforms the elite, an elite replacement occurs. The pseudo-code displaying the working principles of re-sampling with inheritance is shown in Fig. 1.

As shown in Neri et al. (2011a), it can easily be observed that for a given value of $Cr$, the effect of the exponential crossover changes with the dimensionality of the problem. For low-dimensional problems, the trial solution would inherit most of the genes from the elite, while for higher dimensionalities only a small portion of $\mathbf{x}_e$ would be copied into $\mathbf{x}_t$. In order to avoid this problem and make the crossover action independent on the dimensionality of the problem, the following quantity, namely inheritance factor, is fixed:

$$\begin{aligned} \alpha _e \approx \frac{n_e}{n} \end{aligned}$$

(1)

where $n_e$ is the number of genes we expect to copy from $\mathbf{x}_e$ into $\mathbf{x}_t$ in addition to the first gene, which is deterministically copied. The probability that $n_e$ genes are copied is $Cr^{n_e}=Cr^{n \alpha _e}$. In order to control the approximate amount of copied genes and to achieve that about $n_e$ genes are copied into the offspring with probability $0.5$, we impose that:

$$\begin{aligned} Cr^{n\alpha _e}=0.5. \end{aligned}$$

(2)

It can easily be seen that, for a chosen $\alpha _e$, the crossover rate can be set on the basis of the dimensionality as follows:

$$\begin{aligned} Cr = \frac{1}{{\root n{\alpha _e} \of {2}}}. \end{aligned}$$

(3)

By means of formula (3), the expected quantity of information to be inherited from $\mathbf{x}_e$ to $\mathbf{x}_t$ is thus controlled.

2.2 Exploitative local search

This operator is a local search algorithm which perturbs a single solution along its $n$ axes, i.e. separately perturbs each design variable. Other search operators that separately perturb each variable have been extensively proposed in the literature. Some examples that, unlike in the present paper, make use of a randomization are given in Zhou et al. (2008), Ji and Klinowski (2006). The meme here proposed can be seen as a modification of a classical hill-descend algorithm and employs the perturbation logic proposed in Tseng and Chen (2008).

The implementation of this operator requires an additional solution, which will be here referred to as $\mathbf{x}_s$. The trial solution $\mathbf{x}_t$ generated by the first operator is perturbed by computing, for each variable $i$:

$$\begin{aligned} x_s[i]=x_t[i]-\rho [i], \end{aligned}$$

(4)

where $\varvec{\rho }$ is an $n$-dimensional exploratory radius vector. The elements of $\varvec{\rho }$ are reinitialized to a predetermined initial value whenever the local search is activated. Subsequently, if $\mathbf{x}_s$ outperforms $\mathbf{x}_t$, the trial solution $\mathbf{x}_t$ is updated (the values of $\mathbf{x}_s$ are copied into it), otherwise a half step in the opposite direction is performed:

$$\begin{aligned} x_s[i]=x_t[i]+\frac{\rho [i]}{2}. \end{aligned}$$

(5)

Again, $\mathbf{x}_s$ replaces $\mathbf{x}_t$ if it outperforms it. After all the variables have been perturbed, the elite $\mathbf{x}_e$ is replaced by $\mathbf{x}_t$ if it is outperformed by it. If the elite is not updated, i.e. the exploration was unsuccessful, the radius $\varvec{\rho }$ is halved for all variables. The exploration is then repeated again for all the design variables, until a precision criterion is met. In particular, the operator is stopped when the 2-norm of $\varvec{\rho }$, normalized per each element by the corresponding search interval, is smaller than a fixed threshold, as follows:

$$\begin{aligned} \sqrt{\sum \limits _{i=1}^n \left( \frac{\rho [i]}{b[i]-a[i]}\right) ^2}<\varepsilon \end{aligned}$$

(6)

where $\left( b[i]-a[i]\right) $ is the width of the decision space $\mathbf{D}$ along the $i\mathrm{th}$ dimension and $\varepsilon $ is the pre-arranged constant. For the sake of clarity, Fig. 2 displays, in a pseudo-code, the working principles of the exploitative local search.

As a further remark, RIS applies a toroidal management of the bounds. This means that if, along the dimension $i$, the design variable $x[i]$ exceeds the bounds of a value $\zeta $, it is reinserted from the other end of the interval at a distance $\zeta $ from the edge, i.e. given an interval $\left[ a,b\right] $, if $x[i]=b+\zeta $ it takes the value of $a+\zeta $.

2.3 Algorithmic structure and philosophy

The combination of the two memes composing the RIS is arranged straightforwardly. More specifically, a solution is firstly processed by the re-sampling with inheritance and then the outcoming solution $\mathbf{x}_t$ is processed by the exploitative local search. The elite solution $\mathbf{x}_e$, which is possibly the solution $\mathbf{x}_t$ processed by the exploitative local search is then given back to the re-sampling inheritance for further improvement. A pseudo-code description of the RIS algorithmic structure is reported in Fig. 4 and graphically depicted in Fig. 3.

The re-sampling mechanism is supposed to generate a solution which is far away from the current elite while the local search exploits the area of the decision space suggested by the re-sampling operator. In this sense, the proposed RIS is nothing else but a simple multi-start local search. However, the proposed scheme is thought as a global optimization algorithm in the fashion of MC. The inheritance mechanism assures that a part of the genotype of the most promising candidate solution is used to enhance upon its performance, (see Iacca et al. 2012a). Although the re-sampling is an operation that is performed only occasionally and thus has a limited budget devoted to it (with respect to that allotted to the local search), the transmission of some variables from a promising solution to a newly sampled point appears to have a certain impact to the global performance of the algorithm, see results in Sect. 3.1 and Table 13. For a large experimental setup, it has been observed that the RIS version with inheritance is always at least as good as the version without inheritance.

Figure 5 shows the search mechanism of the RIS in a bi-dimensional case. Dashed lines show the search moves performed by the re-sampling mechanism with inheritance, while solid lines represent the search logic of the exploitative local search. The re-sampling mechanism in general performs diagonal moves. The movement along the axes are due to the fact that the representation is bi-dimensional and thus only one variable is perturbed into the trial solution. Obviously, in multi-dimensional cases a portion of the elite is inherited by $\mathbf{x}_t$ while the moves are performed diagonally.

As a fundamental remark, the proposed RIS has been designed by following a bottom-up strategy, (see Iacca et al. 2012a), i.e. building up the algorithmic structure from scratch and adding one operator at the time until a good performance is achieved. The resulting algorithm is indeed extremely simple and appears to be in line with the Ockham’s Razor principle for MC structures formulated in Iacca et al. (2012a).

3 Numerical results

The proposed RIS algorithm has been run with the following parameter setting. The inheritance factor $\alpha _e$, Eq. (1), has been set equal to $0.5$. Regarding the local searcher, the initial search radius $\rho [i]$, see Eqs. (4) and (5), as in Tseng and Chen (2008) has been set equal to $0.4 \times \left( b[i]-a[i]\right) $, i.e. 40 % of the domain width along each variable $i$, while the stop-threshold $\varepsilon $ has been set equal to $10^{-6}$. This configuration of the parameters ensures significant performances over a considerable variety of test problems. In particular, all the algorithms under study have been run over:

The CEC2005 benchmark described in Suganthan et al. (2005) in $30$ dimensions ($25$ test problems)
The BBOB2010 benchmark described in Hansen et al. (2010) in $100$ dimensions ($24$ test problems)
The CEC2008 benchmark described in Tang et al. (2007) in $1000$ dimensions ($7$ test problems)
The CEC2010 benchmark described in Tang et al. (2010) in $1000$ dimensions ($20$ test problems)

Thus, $76$ test problems have been considered in this study. For each algorithm in this paper (see following subsections) 100 runs have been performed. Each run has been continued for $5000\,{\times }\, n$ fitness evaluations, where $n$ is the dimensionality of the problem. For each test problem and each algorithm, the average final fitness value $\pm $ standard deviation over the $100$ available runs has been computed. In order to strengthen the statistical significance of the results, for each test problem the Wilcoxon Rank-Sum test (Wilcoxon 1945) has been also applied, with a confidence level of $0.95$.

The following algorithms with respective parameter setting have been considered for comparison against RIS.

Three Stage Optimal Memetic Exploration (3SOME) proposed in Iacca et al. (2012a) with inheritance factor $\alpha _e=0.05$, middle distance exploration hyper-cube size $\delta $ equal to 20 % of the total decision space width, coefficient of generated points at each activation of the middle distance exploration $k=4$, short distance exploration radius $\rho =0.4$ and local budget fixed to $150$ iterations.
Comprehensive Learning Particle Swarm Optimizer (CLP SO) proposed in Liang et al. (2006) with population size equal to $60$ individuals.
Adaptive Differential Evolution (JADE) proposed in Zhang and Sanderson (2009) with population size equal to $60$ individuals, group size factor $p=0.05$ and parameters adaptation rate factor $c=0.1$.
Cooperatively Coevolving Particle Swarms Optimizer (CCPSO2) proposed in Li and Yao (2012) with population size equal to $30$ individuals, Cauchy/Gaussian-sampling selection probability $p=0.5$ and set of potential group sizes $S=\{2, 5, 10\}$, $S=\{2, 5, 10, 50, 100\}$, $S=\{2, 5, 10, 50, 100, 250\}$ for experiments in $30$, $100$ and $1000$ dimensions, respectively.
Memetic Algorithm with CMA-ES Chains (MA-CMA-Chains) proposed in Molina et al. (2010a) with population size equal to $60$ individuals, probability of updating a chromosome by mutation equal to $0.125$, local/global search ratio $r_{L\over G}=0.5$, BLX-$\alpha $ crossover with $\alpha =0.5$, $n_{ass}$ parameter for Negative Assortative Mating set to $3$, LS intensity stretch $I_{str}=500$ and threshold $\delta ^{\min }_{LS}=10^{-8}$.
Modified Differential Evolution with p-Best Crossover (MDE-pBX) proposed in Islam et al. (2012) with population size equal to $100$ individuals and group size $q$ equal to 15 % of the population size.
Parallel Memetic Structure (PMS) proposed in Caraffini et al. (2013) with inheritance factor $\alpha _e$ set equal to $0.95$, initial search radius $\rho $ equal to $0.4$ and computational budget for the first local searcher set to $150$ iterations. Regarding the second local searcher $h$ is initialised as a vector of $0.1$, $\alpha =2$, $\beta =0.5$, while $\varepsilon $ has been set equal to $10^{-5}$.
compact Differential Evolution (cDE) proposed in Mininno et al. (2011) with rand/1 mutation and exponential crossover, cDE/rand/1/exp, virtual population size equal to $300$, scale factor $F = 0.5$, and proportion of genes undergoing exponential crossover, (see Neri et al. 2011a, $\alpha _m = 0.25$).

It must be remarked that the MA-CMA-Chains employs multiple covariance matrices, (see Hansen et al. 2003). Thus, its memory requirement and computational overhead dramatically grow with the problem dimensionality. In order to tackle large scale problems, (Lozano et al. 2011) suggested a tailored version which uses an efficient local search for high dimensional domains, namely:

Memetic Algorithm with Subgrouping Solis Wets Chains (MA-SSW-Chains), originally proposed in Molina et al. (2010b), with population size equal to $100$ individuals, probability of updating a chromosome by mutation equal to $0.125$, local/global search ratio $r_{L\over G}=0.5$, BLX-$\alpha $ crossover with $\alpha =0.5$, $n_{ass}$ parameter for Negative Assortative Mating set to $3$, LS intensity stretch $I_{str}=500$ and threshold $\delta ^{min}_{LS}=0$.

Thus for the experiments in $1000$ dimensions, MA-SSW-Chains has been used instead of MA-CMA-Chains.

The parameter setting for all the algorithms in this subsection has been carried out by using the parameters suggested in the respective original articles. As for CLPSO and JADE, the population size was set in the original papers depending on the problem dimensionality (without a general rule). In this study, a tuning of the population size values has been performed, resulting in a size of $60$ individuals which turned out to be the most suitable compromise in terms of overall performance. This value is in accordance with the study of Lozano et al. (2011) for setting the Differential Evolution population size.

Experimental results have been divided into groups. Tables 1, 2, 3, and 4 show the comparison against 3SOME, CLPSO, and JADE for the four benchmarks under consideration. Tables 5, 6, 7, and 8 show the comparison against CCPSO2, MA-CMA-Chains (MA-SSW-Chains), and MDE-pBX. Tables 9, 10, 11, and 12 show the comparison against cDE and PMS. The tables in this study display the average final fitness value over the $100$ available runs and the corresponding standard deviation. The results of the Wilcoxon test are also reported in terms of pair-wise comparisons. The symbols “$=$” and “$+$” (“$-$”) indicate, respectively, a statistically equivalent performance and a better (worse) performance of RIS compared with the algorithm in the column label. The best results are highlighted in bold.

Table 1 Average fitness $\pm $ SD and Wilcoxon Rank-Sum test (reference $=$ RIS) for RIS against its predecessor 3SOME and popular meta-heuristics on CEC2005 (Suganthan et al. 2005) in $30$ dimensions

Re-sampled inheritance search: high performance despite the simplicity

Abstract

Similar content being viewed by others

Compact Optimization Algorithms with Re-Sampled Inheritance

Pareto-like sequential sampling heuristic for global optimisation

Adaptive ε-Sampling and ε-Hood for Evolutionary Many-Objective Optimization

Explore related subjects

1 Introduction

2 Re-sampling inheritance search

2.1 Re-sampling with inheritance

2.2 Exploitative local search

2.3 Algorithmic structure and philosophy

3 Numerical results

3.1 The effect of the inheritance

3.2 Statistical ranking by means of Holm-Bonferroni procedure

3.3 Memory and computational overhead

3.4 Tuning of \({\varvec{\rho }}\)

4 Application case: tuning of a control system for a helicopter Robot

4.1 Optimization results

5 Conclusion

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation