Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

There have been many algorithmic approaches proposed to solve complex optimisation problems, including constrained optimisation problems (COP). Several approaches have been proposed to tackle the constraints in constrained problems. Most of the research has been focused on introducing differential evolution (DE) [12], particle swarm optimisation (PSO) [2] and evolutionary strategies (ES) [11] to solve numerical optimisation problems. In order to deal with these constrained problems, there have been techniques that applied to these algorithms such as penalty functions, special operators (separating the constraint and objective function treatment) and decoder based methods. We refer the reader for a survey of constraint handling techniques in evolutionary computing methods to [7].

In order to compare and evaluate the evolutionary algorithms many approaches have been used. One is finding which algorithm performs better on a set of continuous problems using benchmarks sets [3, 5]. Recently, there has been an increasing interest to analyse the problem features that make it hard to solve. Initial studies have been carried out in the field of continuous optimisation in [6]. Furthermore, there have been techniques that generate a variation of problem instances from easy to hard. Then, the features of this problem instances are analysed in order to find which of them make the problems hard or easy to solve. Generating the variety of problem instances from easy to hard ensures that the knowledge obtained from analysis is reliable.

Although there is not only a standalone feature that makes a problem hard to solve, but it is assumed that constraints are very important in constrained continuous problems. The evolving approach that has been used to analyse the constraint features and their effects on COP’s difficulty is discussed in [8, 9]. The idea is to evolve constrained problem instances (by using an evolutionary algorithm) in order to identify the constraint features with more contribution to problem difficulty.

In this paper, by using a single-objective evolutionary algorithm, we generate hard and easy COP instances for DE, ES and PSO algorithms. Later, we solve the generated instances using one algorithm by the other algorithms. The results show that the hardest generated instances using one algorithm are still hard for the other ones. To get better insight, we use multi-objective evolving approach to generate instances that are hard for one algorithm but still easy for the others. By analysing how an algorithm fails in conditions where the rest perform well, we can derive its strengths and weaknesses over constraint features. Our study shows the effectiveness of constraint features that make the problems hard for one and easy for the other algorithms. It can be translated as over which features of constraints, they make the problems hard for a certain algorithm but still easy for the others.

The remainder of this paper is as follows: In Sect. 2 we introduce the concept of COPs. Then we discuss the evolver (single and multi-objective evolutionary approach) and the solver algorithms (DE, ES and PSO) we use in our experiments. In Sect. 3 we analyse the performance of various algorithms on each others hard and easy instances (using the single-objective evolver). Section 4 includes the multi-objective approach that generates hard instances for one but easy for the other algorithms. Furthermore, we carry out the analysis of linear and quadratic constraint features that make the problem hard for one and still easy for the rest. Finally, we conclude with some remarks.

2 Preliminaries

2.1 Constrained Continuous Optimisation Problems

In this study, constrained continuous optimisation problems with inequality and equality constraints are investigated. These problems are optimisation problems where a function f(x) should be optimised with respect to a given set of constraints.

Single-objective functions \(f :S \rightarrow \mathbb {R}\) with \(S \subseteq \mathbb {R}^n\) are considered in this research. The constraints impose a feasible subset \(F \subseteq S\) of the search space S and the aim is finding \(x \in S \cap F\) which minimises f. Formally, we state the problems as follows:

$$\begin{aligned} \text {minimize }&\quad f(x), \quad x = (x_1,\ldots ,x_n) \in \mathbb {R}^n \nonumber \\ \text {subject to}&\quad g_i(x) \le 0 \quad \forall i \in \{1,\ldots ,q\}\nonumber \\&\quad h_j(x) = 0 \quad \forall j \in \{q + 1,\ldots , p\} \end{aligned}$$
(1)

where \(x = (x_1,x_2,\dots ,x_n)\) is an n dimensional vector and \(x \in S \cap F\). The \(g_{i}(x)\) (inequality) and \(h_{j}(x)\) (equality) constraints could be linear/nonlinear. Also, the equality constraints are usually replaced by \(|h_j(x)| \le \varepsilon \) where \(\varepsilon =10e^{-4}\) [5]. The feasible region \(F \subseteq S\) of the search space S is defined by

$$\begin{aligned} l_i \le x_i \le u_i,\quad \quad 1 \le i \le n \end{aligned}$$
(2)

where \(l_i\) and \(u_i\) denote lower and upper bounds respectively for the ith variable in which \(1\le i \le n\). In this paper, we focus on the ability of constraints (linear, quadratic) to make a problem hard or easy. The features of these constraints and their effect on problem difficulty is discussed. The constraints are of the following form:

$$\begin{aligned} \text {linear constraint} \quad g(x) = b + a_{1}x_{1} + \ldots + a_{n}x_n \end{aligned}$$
(3)
$$\begin{aligned} \text {quadratic constraint} \quad g(x) = b +a_{1}x_{1}^{2} + a_{2}x_{1}\ldots + a_{2n-1}x_n^{2} +a_{2n}x_n \end{aligned}$$
(4)

or a combination of them, where \(x_{1},x_{2} \dots ,x_{n}\) are values from Eq. 1 and \(a_{1},a_{2},\dots ,a_{n}\) are coefficients within lower (\(l_{i}\)) and upper bounds (\(u_{i}\)). We assume univariant quadratic function to analyse each \(x_{n}\) (with exponent 2) independently. Also, unvivarient quadratic constraints are more popular in recent benchmarks [5]. In order to include the optimum of objective function in feasible area, we set \(b\le 0\) (we assume the objective function optimum is zero).

2.2 Algorithms

We now introduce the algorithms for constrained continuous optimisation that are subject to our investigation.

One of the most prominent evolutionary algorithms for COPs is \(\varepsilon \)-constrained differential evolution with an archive and gradient-based mutation (\(\varepsilon \)DEag). The algorithm is the winner of 2010 CEC competition for continuous COPs [5]. The \(\varepsilon \)DEag uses \(\varepsilon \)-constrained method to transform algorithms for unconstrained problems to constrained ones. It adopts \(\varepsilon \)-level comparison to order the possible solutions. In other words, the lexicographic order is used in which constraint violation (\(\phi (x)\)) has more priority and proceeds the function value (f(x)). For more details we refer the reader to [13].

The second algorithm we use in this paper is a \((1+1)\) CMA-ES for constrained optimisation [1]. The \((1+1)\) CMA-ES in [4] is a variant of \((1+1)\)-ES which adapts the covariance matrix of its offspring distribution in addition to its global step size. The idea behind the constraint handling approach of this algorithm is to obtain approximations to the normal vectors directions in the vicinity of the current solutions locations by low-pass filtering steps which violates the respective constraints and reducing the variance of the offspring distribution in these directions. Incorporating this constraint handling approach with \((1+1)\) CMA-ES makes an algorithm which is significantly more efficient than other approaches for constrained evolutionary algorithms. Also, the selected algorithm is not sensitive to the rotation of the problem search space. We refer the reader to [1] for more details and implementation.

The third algorithm that is used in our investigation is a particle swarm optimisation. This algorithm (HMPSO) applies a method that uses parallel search operator in which it divides the current swarm into various sub-swarms and locates the solution between them. In each sub-swarm, all particles follow the local best (fittest particle) which improves them to be more fitter. Also, since all sub-swarms are located around different optima (in parallel), then it is more possible to locate multiple optima which improves the diversity of algorithm. Dividing the swarms into sub-swarms improves the diversity of the algorithm. Also, choosing the local best in each sub-swarm can attract the other particles to fitter positions. We refer the reader to [15] for detailed algorithm and implementation.

2.3 Features of Constraints

In this paper we analyse the constraint features of generated problem instances. These features are constraint coefficients relationships such as standard derivation, angle between constraint hyperplanes, feasibility ratio in vicinity of optimum, number of constraints, shortest distance of constraint hyperplane to optimum. The details of these features are discussed in [8].

3 Single-Objective Investigations

We first consider different algorithms and compare their relative performance on each other’s generated hard and easy instances. We use single-objective evolver to evolve and generate hard and easy instances for all types of algorithms. The detailed procedure and results for DE instances are discussed in [8]). For this experiment, we perform 30 independent runs generating easy and hard instances for PSO and ES solvers. It means, the single-objective evolver only generates instances that are hard/easy for one type of algorithm (PSO, ES and DE). The required function evaluation number (FEN) for solving these instances (PSO, ES and DE) is used as fitness value for single-objective evolver. The parameters for solvers are identical to [1, 13, 15]. Also, we run our experiments on Sphere function (bowl shaped) [3]. We now have three groups of easy and hard instances generated for DE, ES and PSO algorithms. We then compare the DE, ES and PSO algorithms by applying them on each other’s easy and hard instances. The analysis is done by comparing the required FEN for an algorithm to solve the other’s generated problem instances. Then, it is possible to derive strengths and weaknesses of the considered algorithms by observing how well one algorithm performs in conditions where the other algorithms fail (or it is difficult for them). Tables 1 and 2 show different algorithms performance on Sphere objective functions with linear/quadratic constraints (1 to 5 constraints). Considering the required FEN to solve each instances, it is observed that hard instances are still the hardest for their own algorithms and hard for the others. It implies that the hard instances share some common features to make it difficult to solve for all solvers. However, the obtained knowledge is not enough to compare the algorithm capabilities to solve hard problem instances.

Table 1. The comparison of algorithms performance on each other’s easy and hard instances based on required FEN for Sphere objective function. DE Easy (1 c) means instances that are easy for DE and with 1 linear constraint.
Table 2. The comparison of algorithms performance on each other’s easy and hard instances based on required FEN for Sphere objective function. DE Easy (1 c) means instances that are easy for DE and with 1 quadratic constraint.

4 Multi-objective Investigations

Based on the experiment results in previous section, hard instances for each algorithm are still hard for the others. In order to extract more useful knowledge about the strengths or weaknesses of certain algorithms on constraint algorithms, we need problem instances that are hard for one and easy for the others. Analysing the features of these instances helps us extracting knowledge regarding the strengths and weaknesses of algorithms by examining why an algorithm performs better on some groups of features while the others fails. This will help us developing more efficient prediction model for automated algorithm selection.

To do this, we use a multi-objective DE algorithm (DEMO) described in [10] to minimise the FEN for one algorithm and maximise it for the others. In other words, the FEN for generated problem instances is higher (harder) for a certain algorithm and lower (easier) for the others. In order to find instances that are hard for one algorithm type and easy for the others, we need to find solution as diverse as possible. Also, the solutions need to be close to pareto front. Satisfying these two aims makes us to use multi-objective evolutionary algorithm to generate problem instances. Hence, we use differential evolution for multi-objective optimisation (DEMO) proposed by Robic in [10]. Based on results in [10], the DEMO achieves efficiently the above two goals. In DEMO, the candidate solution replaces parent when it dominates it and if the parent dominates it, the candidate is discarded. Otherwise, if the candidate and parent cannot dominate each other, the candidate is added to the population. The major difference between DEMO and other multi-objective evolutionary algorithms is that the newly generated good candidates are immediately used in creation of the subsequent candidates. This improves fast convergence to the true pareto front, while the use of non-dominated sorting and crowding distance metric in truncation of the extended population promotes the uniform spread of solutions. We refer the reader to [10] for further details and implementation.

In the following, we discuss the results for algorithms performances comparison. We carry out 30 independent runs for each number of constraints that are hard for one algorithm but still easy for the others. We set the evolving algorithm (DEMO) generation number to 5000 and the other parameters of evolving algorithm are set to pop size = 40, CR = 0.5, scaling factor = 0.9 and \(FEN_{max}\) is 300K. Values for these parameters have been obtained by optimising the performance of the evolving algorithm in order to achieve the more easier and harder problem instances. For each of three algorithms, their best parameters are chosen [3, 13, 15]. First, the (\(\varepsilon \)DEag) algorithm parameters are considered as: generation number = 1500, pop size = 100, CR = 0.5, scaling factor = 0.5. Also, the parameters for e-constraint method are described in [8]. Moreover, for evolutionary strategy we perform (1, 7)-ES algorithm with 1500 generation using \(P_{f} = 0.4\) with tendency to focus on feasible solution. In HMPSO algorithm, the swarm size N is set to 60, each sub-swarm size (\(N_{s}\)) is 8 and all the PSO parameters are considered as Krohling and Coelho’s PSO [14]. In order to solve generated COPs, HMPSO generation number is set to 1500. We need to say the parameters for the solvers are identical to those given in [1, 10, 13, 15]

In our all experiments, we generate set of problem instances that are hard to one algorithm and easy to the other ones. Tables 3, 4, 5, 6, 7 and 8 show the function evaluation number (FEN) required for each algorithm to solve DE/ES/PSO hard instances for Sphere, Ackley and Rosenbrock objective functions (with 1 to 5 linear/quadratic constraints). As it is observed, there is more difference between the required FEN of instances generated by multi-objective algorithm evolver than the single-objective one. For instance, the required FEN for solving DE hard instances are higher for DE algorithm than solving it by ES and PSO algorithm. It means the DE hard instances are only hard for DE algorithm and easy for the others. In the following we start analysing constraint features of instances that are hard for one and easy for others.

Table 3. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Sphere for 1 to 5 linear constraints)
Table 4. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Sphere for 1 to 5 quadratic constraints)
Table 5. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Ackley for 1 to 5 linear constraints)
Table 6. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Ackley for 1 to 5 quadratic constraints)
Table 7. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Rosenbrock for 1 to 5 linear constraints)
Table 8. The FEN required for each algorithm to solve DE/ES/PSO hard instances (Rosenbrock for 1 to 5 quadratic constraints)

4.1 Analysis for Linear Constraints

We run our experiments on Sphere, Ackley and Rosenbrock objective functions. The linear constraints are considered as in Eq. 3 with all coefficients \(a_{n}\)s that are in the range of \([-5,5]\). Also, the problem dimension is set to 30. As it mentioned before, to analyse and discuss some features such as shortest distance, we assume that the optimum is zero (\(b\le 0\)). We use three types of problem instances. DE hard denotes problem instances that are hard for DE algorithm but still easy for PSO and ES algorithms. Also, ES hard instances are easy for DE and PSO algorithm in this section. PSO hard means the instances that are hard for PSO but easy for the rest. Each constraint is generated using multi-objective evolver to generate instances that are hard for one algorithm but easy for others. In the following we discuss the features of linear constraints.

Figure 1 represents some evidence of linear constraint coefficient relationship (standard deviation). It is shown that standard deviation of (1 to 5) linear constraints are higher for DE hard instances than ES and PSO hard ones. This result is similar for all Sphere, Ackley and Rosenbrock objective functions. This means, the instances that are hard for DE algorithm but easy for ES and PSO have higher standard deviation for their constraints coefficients. In other words, this constraint feature has influence on problem difficulty. This improves the prediction ability for algorithm selection framework.

Box plots shown in Fig. 2 represent the shortest distance from optimum feature for hard instances. Based on the experiments, hard instances for ES algorithm have higher value (closer to optimum) shortest distance than the other algorithms. It is noteworthy that lower value in Fig. 2 means the constraint hyperplane is further from optimum. In other words, the constraints hyperplanes are closer to the optimum in ES hard instances. This relationship holds the pattern for all objective functions in linear constraints. We also study the feasibility ratio in vicinity of the optimum. As observed in Table 10, hard DE instances have lower feasibility ratio comparing to PSO and ES hard instances. This follows the same pattern for all experimented objective functions. Also increasing the number of constraints decreases the problem optimum-local feasibility for all algorithm problem instances. The angle between linear constraints feature is analysed for linear constraints. As it is observed in Table 9, ES hard instances have lower angle values for all Sphere, Ackley and Rosenbrock objective functions. This means, instances that are hard for ES have less angle value between their constraint hyperplanes. Interestingly, all objective function that we use in this experiment follow the same relationship.

As it is observed, to compare the instances, DE hard instances have higher linear constraint coefficient standard deviation. It can be translated as DE algorithm has more difficulty to coefficients standard deviation feature than PSO and ES algorithms. Also, the local-optimum feasibility ratio value is higher in ES and PSO hard instances than DE hard ones. This means, ES and PSO algorithms are more effective to problems with higher optimum feasibility ratio feature. The shortest distance and angle features for ES is less than DE and PSO hard instances. Interestingly, this features are similar for all used objective functions. The linear constraint feature based analysis gives us helpful knowledge to implement algorithm selection framework.

4.2 Analysis for Quadratic Constraints

In this section, we carry out our experiments on Sphere, Ackley and Rosenbrock objective function with quadratic constraints (see Eq. 4) using same setup as previous section. In the following we do feature based analysis of constraints in hard DE, PSO and ES instances (that are easy for the other algorithms).

Figure 1 shows some evidence of quadratic constraint coefficients relationship. Based on our experiments, in each constraint, the quadratic coefficient has more ability than linear coefficients to make problem harder to solve. In other words, in Eq. 4, \(a_{1}\) is more contributing than \(a_{2}\) to problem difficulty. As it is shown in the box plots, the standard deviation of 1 to 5 quadratic constraints in DE hard instances are higher comparing the other two algorithm hard instances. In contrast, our results show no systematic relationship between problem difficulty and linear coefficients in each quadratic constraints and quadratic coefficients have more contribution in problem difficulty.

As it is observed in Fig. 2, the shortest distance feature for DE, PSO and ES hard instances are compared. In instances that are hard for ES and easy for the other algorithms, the quadratic constraint hyperplanes are closer to optimum (zero). This applies to all experimented objective functions. Also, calculating the angle feature for quadratic constraint does not show any systematic relationship to problem difficulty. The feasibility ratio near the optimum is analysed for DE, ES and PSO hard instances. As it is shown in Table 11, the feasibility ratio in DE hard instances are lower than the other algorithms hard instances. All objective functions have the same pattern. Also, the number of constraint has a systematic relationship with feasibility ratio.

Based on the results, to compare COP instances with quadratic constraints, DE hard instances have higher coefficient standard deviation value than the other algorithm hard ones. It is translated as the DE algorithm has more difficulty solving instances with higher standard deviation value for their quadratic constraints than ES and PSO. Also, the quadratic constraints are closer to optimum in ES instances than the other experimented algorithms. In other words, ES algorithm is more influenced by constraint with closer to optimum instances. Moreover, the optimum feasibility ratio in DE instances are lower than PSO and ES.

Table 9. The angle feature for Sphere objective function for linear constraints
Table 10. Optimum-local feasibility ratio of search space near the optimum for 1, 2, 3, 4 and 5 linear constraint
Table 11. Optimum-local feasibility ratio of search space near the optimum for 1, 2, 3, 4 and 5 quadratic constraint
Fig. 1.
figure 1

Box plot for standard deviation of coefficients in linear constraints with objective functions: Sphere (A), Ackley (C) and Rosenbrock (E) and quadratic constraints Sphere (B), Ackley (D) and Rosenbrock (F). Each sub figure includes hard instances (H) with 1 to 5 constraints using algorithms (a/b/c denotes a: number of constraints, b: hard instances and c: hard instances for DE/ES/PSO algorithm).

Fig. 2.
figure 2

Box plot for shortest distance feature in linear constraints with objective functions: Sphere (A), Ackley (C) and Rosenbrock (E) and quadratic constraints Sphere (B), Ackley (D) and Rosenbrock (F). Each sub figure includes hard instances (H) with 1 to 5 constraints using algorithms (a/b/c denotes a: number of constraints, b: hard instances and c: hard instances for ES/PSO/DE algorithm).

5 Conclusion

In this paper, we carried out an algorithm performance comparison on each others constrained problem instances. We then analysed the features and characteristics of constraints that make them hard to solve for certain algorithm but easy for the others. It is observed that some constraint features are more contributing to problem difficulty for certain algorithms. In linear constraints, some features such as coefficient relationship, angle, local-optimum feasibility ratio and shortest distance play an important role in problem difficulty to DE and ES algorithms. Considering quadratic instances, angle does not show any relationship to problem difficulty.

By analysing how well one algorithm performs in conditions where other algorithms fail, we can derive its strengths and weaknesses over constrained problems. These results can help us to improve the efficiency of algorithm prediction model.