1 Introduction

Optimization problems can be classified according to the type of constraints (linear or nonlinear), optimization variables (continuous, discrete or binary) and solution region convexity, which is an important factor when determining the methodology implemented in its resolution (Calafiore and El Ghaoui 2014).

Classic optimization methods, when applied to solve nonconvex problems, tend to converge to local optima, since they are highly susceptible to initial conditions (Bazaraa et al. 2013). In order to solve this issue, the union of concepts from optimization and artificial intelligence made it possible to develop more efficient and intelligent heuristic strategies, named as meta-heuristics or computational intelligence (Neri and Cotta 2012). Computational intelligence is defined as the science that develops methodologies that are able to mimic similar behaviors of intelligent systems (human or bioinspired), aiming at solving complex optimization problems.

Given this context, in the last years, several researchers have been developing new approaches to solve optimization problems through computational intelligence techniques, being inspired by natural existing biological adaptation mechanisms. Algorithms inspired in the evolution of species theory, such as the genetic algorithm (GA) (Holland 1992), swarm intelligence, such as the particle swarm optimization (PSO) (Eberhart and Kennedy 1995), bats echolocation, such as the bat algorithm (BA) (Yang 2010), among others have been developed as a result of such an effort. These algorithms have presented great efficiency in solving not only engineering-related problems, but also in other fields of knowledge. Currently, these bioinspired algorithms are a part of a set of tools associated with a great number of high-technology projects.

One of the greatest challenges faced by computational intelligence techniques is the balance between conducting local or global search throughout the iterative process, generally performed through exploration coefficients that are updated over the course of iterations. This inherent limitation to the computational intelligence techniques may negatively affect the efficiency of these algorithms in solving large-sized optimization problems, making them stagnate at local optimum values.

In this sense, this paper presents an adaptation never seen in the literature, to perform the update of the exploration coefficient of the bioinspired computational intelligence technique known as grey wolf optimizer (GWO) (Seyedali et al. 2014), being this the main contribution of this work. The methodology validation will be performed through the comparison of the proposed approach with the GA, BA and PSO techniques in 10 benchmark functions. Besides that, to prove the effectiveness of the proposed methodology on solving real large-sized engineering optimization problems, a study case applied to the optimization of the layout of an offshore wind farm will be presented.

The content of this paper is divided as follows: Sect. 2 presents the computational intelligence technique known as GWO, as well its chaotic (Kohli and Arora 2018) and the proposed version by the authors. In Sect. 3, the proposed methodology is presented and validated on several benchmark functions and compared to the original and chaotic version of GWO, and also to the GA, BA and PSO techniques. Section 4.1 presents the wind farm layout optimization (WFLO) problem formulation, and the results obtained by the proposed methodology are discussed and compared to the aforementioned techniques. A conclusion is presented in Sect. 5.

2 Grey wolf optimizer: novel exploration coefficient

The GWO is a bioinspired meta-heuristic presented by Seyedali et al. (2014) in 2014, and it is an algorithm that is being applied in many researches (El-Fergany and Hasanien 2015; Routray et al. 2019; Faris et al. 2018; Yan et al. 2019; Mostafa et al. 2018; Verma et al. 2017; Hernndez et al. 2019; Panoeiro et al. 2020). The algorithm mathematically models the hunting and social behavior of grey wolves to solve optimization problems. Grey wolves generally live on a pack, that is basically composed of four types of wolves: alpha (\(\alpha \)), beta (\(\beta \)), delta (\(\delta \)) and omega (\(\omega \)).

The hunting strategy of grey wolves is divided into three steps, as follows: (1) tracking, chasing and approaching the prey; (2) chasing and encircling until stationary situation and (3) attacking the prey.

In the optimization context, the three best candidate solutions obtained by the algorithm are depicted by the \(\alpha \), \(\beta \) and \(\delta \) wolves. These solutions are used in the algorithm to govern the searching process. The remaining solutions are called \(\omega \), and over the course of iterations they encircle the \(\alpha \), \(\beta \) and \(\delta \) wolves in order to attempt to reach better solutions.

Algorithm 1 depicts a pseudocode of the GWO method. At first, a random population composed of \(N_{\text {wolves}}\) is created and evaluated through an objective function (OBF). Then, the hierarchy is established and the iterative process starts, until a stopping criteria are met. Then, from lines 7–11 the algorithm emulates the hunting behavior of grey wolves, and more details of the presented pseudocode can be found in reference (Seyedali et al. 2014).

figure a

As stated in Seyedali et al. (2014) and depicted in Fig. 1, the algorithm performs local search, i.e., attacks the prey, if \(|\overrightarrow{A}| < 1\), and global search, i.e., searches for other preys, if \(|\overrightarrow{A}| > 1\). As for the coefficient \(\overrightarrow{C}\) its value attenuates or increases the magnitude of the best solution during the searching mechanism. Vectors \(\overrightarrow{r_1}\) and \(\overrightarrow{r_2}\) are randomly distributed values \(\in \) [0,1] and help the algorithm to avoid stagnation at local minima. Parameters \(\overrightarrow{D_{\alpha }}\), \(\overrightarrow{D_{\beta }}\) and \(\overrightarrow{D_{\delta }}\) represent the distance between an omega wolf and the dominant wolves. Parameters \(\overrightarrow{X_{1}}\), \(\overrightarrow{X_{2}}\) and \(\overrightarrow{X_{3}}\) weigh this distance input by the parameter \(\overrightarrow{A}\) and defines if a local/global search will be performed. At last, line 11 depicts the position update taking into account the information from the dominant wolves \(\alpha \), \(\beta \) and \(\delta \). At the end of the iterative process, the best solution \(X_{\alpha }\) is returned.

Fig. 1
figure 1

Global and local searching components of GWO

Line 5 highlights the importance of the exploration coefficient \(\overrightarrow{a}_t\), since it governs whether the algorithm will perform a local or global search. In Seyedali et al. (2014), the authors proposed a linear decaying function in order to update the exploration coefficient \(\overrightarrow{a}_t\). In this paper, however, the authors propose a triangular-shaped function to update the exploration coefficient, giving rise to a version called proposed grey wolf optimizer (PGWO). Also, the concept of chaotic optimization is taken into account by the authors and two chaotic maps are implemented to replace the linear-decaying exploration coefficient \(\overrightarrow{a}_t\).

Swarm intelligence algorithms have proven themselves as an excellent tool to solve optimization problems. However, one of the greatest challenges faced by these algorithms is related to the stagnation in local optimum values after a certain number of iterations. To avoid such thing and improve algorithm’s performance, several works in the literature are proposing modifications in the original versions of the meta-heuristics. Within the scope of GWO, this is no different: in Kaur and Narang (2019), a hybrid version of the GWO, introducing three mutation strategies, is proposed. In Wen (2016) and Mittal et al. (2016), the authors propose a nonlinear adjustment of the exploration coefficient \(\overrightarrow{a}_t\). In Zhang et al. (2017), the opposite-learning technique is used to obtain a better set of initial solutions for the studied problem.

In Kohli and Arora (2018) the authors propose the use of chaotic maps to update parameters \(\overrightarrow{A}\), \(\overrightarrow{C}\) and \(\overrightarrow{a}_t\) to explore the search space more dynamically. The use of chaotic maps in the exploration parameters update of computational intelligence techniques has been widely studied by researchers in the past few years. In Gandomi and Yang (2014), chaotic maps are used to update parameters of the BA. In Gandomi et al. (2013), the authors evaluate the performance of 12 chaotic maps to update parameters of the firefly optimization (FO) algorithm. In Amir Hossein et al. (2013), the update is performed in the PSO method. At last, in Emary and Zawbaa (2016) the authors analyze the impact of chaotically updating the exploration coefficient in different meta-heuristics, among them the GWO, the ant lion optimizer (ALO) and the moth-flame optimizer (MFO). In general, by analyzing the works that introduce chaos in meta-heuristics, it can be observed that the updates following chaotic maps happen in a pseudorandom fashion, indicating a more dynamic searching process, contributing to avoid stagnation.

Following this line of research, this work implements 2 different chaotic maps, taken from Gandomi and Yang (2014), to update the exploration coefficient \(\overrightarrow{a}_t\). The algorithms following these chaotic updates are named chaotic grey wolf optimizer (CGWO) in this paper and are represented by Eqs. (1) (CGWO1) and (2) (CGWO2). Figure 2 graphically depicts the differences between the chaotic and linear updates.

$$\begin{aligned} x_{t+1}= & {} \mathrm{cos}(t \mathrm{cos}^{-1}(x_t)) \end{aligned}$$
(1)
$$\begin{aligned} x_{t+1}= & {} 1,07\left( 7,86 x_t - 23,31 x_{t}^2 + 28,75 x_{t}^3 - 13,3 x_{t}^4\right) .\nonumber \\ \end{aligned}$$
(2)
Fig. 2
figure 2

Exploration coefficient update

Besides the chaotic version of GWO, this work proposes the update of the exploration coefficient \(\overrightarrow{a}_t\) in a novel manner, not yet seen in the specialized literature. This novel approach aims at introducing to the GWO a more dynamic transition between local and global search, enabling it to perform local and global searches throughout the entire execution of the algorithm, improving its capability of avoiding local optimal values even in the last few iterations. This modification improves the algorithm’s performance, and it is the major contribution of this paper. The proposed modification gives rise to a version named PGWO by the authors and represents the main novelty and contribution of this paper. The proposed function consists on a triangle-pulsed wave, shown in Eq. (3). At this point, it is important to highlight that the authors have tested different similar functions, but chose to present this one at this work, since it presented the best results obtained so far. Figure 2 also depicts the proposed approach in comparison with the traditional linear update of the exploration coefficient originally proposed in Seyedali et al. (2014).

$$\begin{aligned} x_{t+1} = \frac{2}{\pi } |\mathrm{sin}^{-1}[\mathrm{sin}(\pi t)]|. \end{aligned}$$
(3)

3 Method validation

In order to validate the PGWO method, the authors compared its performance with the traditional version of the GWO algorithm, as well as its chaotic version, both presented in Sect. 2. Besides that, a comparison with respect to the well-known GA, BA and PSO techniques is also presented, in the optimization of 10 nonconvex benchmark functions.

Table 1 depicts the parameters adopted for each optimization method. All techniques populations were composed of 40 individuals, and the stopping criterion adopted for all algorithms was the maximum allowed number of 40 iterations. The remaining parameters shown in Table 1 are values recommended by the authors of the methods. It should be noticed that the GWO is the method with the least number of parameters to adjust.

Table 1 Parameters adopted for the compared meta-heuristics

In order to evaluate the PGWO robustness to solve a wide range of optimization problems, the authors performed 1000 simulations of each methodology to solve each one of the 10 benchmark functions. It should be highlighted that all methodologies started from the same initial solutions, in order to make the comparison with other techniques fairer. The mathematical functions as well as the coordinates of the global optimal results and the variables boundaries can be obtained from Ali et al. (2005).

The methodologies assertiveness rates, i.e., the percentage of simulations that achieved the global optimum result for each function, are depicted in Table 2, and it can be seen that the PGWO method obtained higher assertiveness rates for almost every benchmark function tested. In five opportunities, the proposed methodology reached the global optimum value for all 1000 simulations performed. The mean value of assertiveness shows that the PGWO has shown itself as a very competitive tool with respect to other methodologies to solve optimization problems, reaching the global optimum value in 97.4% of times, outperforming the original GWO version. It can also be highlighted that, for the functions under consideration, the chaotic versions of GWO did not have a positive impact in the algorithm’s overall performance.

Table 2 Methodologies assertiveness rates

Figure 3 depicts, in a form of a boxplot, the results shown in Table 2. From Fig. 3 it can be observed that the results obtained from the proposed method were less dispersed with respect to all other investigated methodologies, reinforcing the idea that this is a fairly robust method to solve nonconvex optimization problems. It can also be highlighted that the PGWO method presented the highest median value. This improvement is explained by the greater dynamics introduced by the novel exploration coefficient update, enabling the algorithm to escape from stagnating at local optimum values over the course of iterations.

Fig. 3
figure 3

Methodologies assertiveness over 10 benchmark functions

It can also be emphasized that the GWO, CGWO1 and CGWO2 techniques have also outperformed the GA, BA and PSO algorithms, indicating that the use of the three best solutions to govern the searching process is indeed effective, as stated in Seyedali et al. (2014).

3.1 Proposed grey wolf optimizer robustness regarding initial solution

Besides the capacity of finding good solutions, it is desirable that a computational intelligence technique performs well regardless of its initial set of solutions (Vasant et al. 2016). This characteristic may be understood as a way to measure a method’s robustness with respect to the initial set of candidate solutions.

To evaluate this important aspect, 1000 simulations of the 4 GWO-based methodologies were performed to minimize the Aluffi-Pentini function in two situations. For the first situation, all techniques (GWO, PGWO, CGWO1 and CGWO2) start from a fixed initial population, the same way as for the study regarding all benchmark functions previously presented. For the second situation, the algorithm’s repetition was made starting from different random initial solutions, coming from an uniform distribution.

Figure 4 depicts the boxplot of all methodologies in obtaining the optimal result of − 0.35 for the Aluffi-Pentini function. In this context, FIP stands for fixed initial population and VIP for variable initial population.

Fig. 4
figure 4

Methods robustness regarding initial solutions. (Aluffi-Pentini function)

In order to verify whether the samples depicted in Fig. 4 are statistically equal, a test hypothesis was performed. The t test for two samples, described in Montgomery and Runger (2010), allows one to verify the probability that the differences between two samples are not caused by chance, and it is called p value. It is usual to adopt a significance level of 5%, meaning that if the p value is less than 5%, the null hypothesis that establishes equality between samples is rejected.

Considering this significance level, the authors applied this hypothesis test for the samples obtained for each one of the 4 methodologies. The p values obtained were: (1) GWO: 0.09%; (2) PGWO: 90.37%; (3) CGWO1: 32.64%; and (4) CGWO2: 40.46%. For the PGWO, CGWO1 and CGWO2, the p values obtained were greater than 5%, meaning that the null hypothesis cannot be rejected.

The results from the t test for two samples indicate that, for the proposed method, the samples do not have significant differences, indicating the method’s robustness. Furthermore, the p value for the proposed method was the largest among all techniques studied, indicating its greatest robustness regarding initial solution conditions.

In addition, the authors make available the codes to evaluate all tested benchmark functions through the tested versions of the GWO via the following link: https://github.com/ufcod/pgwocodes.git.

4 Real problem: Wind Farm Layout Optimization

4.1 Problem Data

The development of renewable energy sources increases year after year worldwide. In this context, the use of wind energy has gained a lot of importance in the past decade. According to the last report produced by the Global Wind Energy Council (GWEC) (2018), in 2018 51.3 GW of power was added, totaling an installed capacity of 591 GW worldwide.

One of the factors that most affects the energy extraction of a wind farm is the turbines layout (Elkinton et al. 2005). The layout optimization of wind farms is normally given through the division of the terrain in cells, where each cell represents a possible position to allocate a wind turbine. Given the combinatorial nature of determining the optimal location of the wind turbines, the WFLO problem has been resolved through the application of computational intelligence techniques, able to solve large-sized combinatorial optimization problems in a reasonable computational time (Le and Vo 2017; Wu et al. 2020; Yang et al. 2019; Park et al. 2019; Pillai et al. 2018; Wilson et al. 2018; Feng and Shen 2015).

In this work, the authors implemented the 4 GWO versions explained in Sect. 2, as well as the GA, BA and PSO methods to determine the optimal layout of a wind farm of 500 MW of installed capacity. The wind farm under study has its area represented by a (\(10 \times 10\)) matrix, at which each cell has dimensions of 900 m \(\times \) 900 m, equivalent to 5 times the rotor diameter. This problem modeling is of binary nature, at which the total number of possible combinations is of \(2^{100}\), since each one of the 100 cells in the matrix can contain (‘1’) or not (‘0’) a wind turbine (Sittichoke et al. 2013). This huge number of combinations possibility demonstrates this problem complexity, justifying the use of computational intelligence techniques to solve it.

The wind turbine model used in this work is based on the DTU 10 MW (Bak et al. 2013), of 10-MW rated power. Since this work will optimize a wind farm of 500 MW of installed capacity, 50 units of the DTU10MW turbine will be considered in this study. Equation (4) describes the power curve, i.e., the electrical power extracted by this turbine in function of the speed of incident wind (u).

$$\begin{aligned} P_{\text {kW}}(u) = {\left\{ \begin{array}{ll} 0,&{}\quad \text {if}\,u< 4 \\ 7.0564u^3,&{}\quad \text {if}\,4 \le u< 11.4 \\ 10{,}000,&{}\quad \text {if}\,11.4 \le u < 25 \\ 0,&{}\quad \text {if}\,u \ge 25 \end{array}\right. } \end{aligned}$$
(4)

Regarding wind speed and direction, the authors consider a mean speed of 11.23 m/s, which corresponds to the rated wind speed of the DTU10MW turbine, and an incidence direction from north to south. Additional data necessary to model the problem, such as the rotor diameter, height of the hub, impulse coefficient (\(C_T\)) and roughness of the ground, are depicted in Table 3.

Table 3 Additional data

4.2 Wake effect

The conversion of the kinetic energy from the winds into electrical energy through the wind turbine is not ideal, i.e., there is a reduction of the wind’s speed when passing through the turbine. In a wind farm, composed of many turbines, the ones located upstream reduce the wind velocity that will reach downstream turbines. This phenomenon is known as wake effect. According to Barthelmie and Jensen (2010), in offshore wind farms this effect can reduce the energetic potential of the venture between 10 and 15%.

Given this context, Jensen (1983) presented a simplified model of the wake effect, modeling the wind behavior after interacting with a wind turbine. According to Jensen, the interaction between wind and turbine is based upon the principle of movement conservation, as depicted in Eq. (5), and it creates a conical region that has the same radius of the turbine and linearly expands with distance.

$$\begin{aligned} \pi r_\text {turb}^2 u + \pi (r_\text {est}^2 - r_\text {turb}^2)u_0 = \pi r_\text {est}^2 u_j \end{aligned}$$
(5)

where \(r_\text {turb}\) depicts the turbine radius and \(r_\text {est}\) the radius at a distance x from the wind turbine. The mean wind speed at the region is represented by \(u_0\), while the wind speed right after interacting with a turbine is represented by u. The wind speed at a distance x from the wind turbine is represented by the variable \(u_j\).

Figure 5 depicts the wake effect and its impact at the wind speed after interacting with the wind turbine i. The wind, with initial speed \(u_{0}\), has its velocity reduced at the moment of energy extraction, being the wind velocity that reaches a downstream turbine j given by Eq. (6).

$$\begin{aligned} u_{j} = u_0 \left[ 1 - \left( \dfrac{2a}{ 1+ \alpha \left( \frac{ x_{ij}}{r_{i}} \right) ^{2} } \right) \right] , \end{aligned}$$
(6)

where \(x_{ij}\) is the position of the wind turbine, and \(r_{i}\) is the effective radius of the upstream rotor that is related to the downstream rotor radius \(r_j\) according to (7). Besides that, a is related to the impulse coefficient \(C_T\), as shown in (8) and \(\alpha \) is a drag constant obtained from (9), where z is the hub height and \(z_0\) depicts the soil rugosity. This work considers \(z_0 = 0.0002\), corresponding to an offshore region.

$$\begin{aligned} r_i= & {} r_j\sqrt{\dfrac{1-a}{1-2a}} \end{aligned}$$
(7)
$$\begin{aligned} C_T= & {} 4a(1-a) \end{aligned}$$
(8)
$$\begin{aligned} \alpha= & {} \dfrac{0,5}{\ln \left( \dfrac{z}{z_0} \right) }. \end{aligned}$$
(9)
Fig. 5
figure 5

Wake effect

In case there is more than one turbine impacting the wind velocity, the energy decrease at each upstream turbine must be taken into account, as shown in (10).

$$\begin{aligned} u_i = u_0 \left[ 1 - \sqrt{\sum _{i=1}^{N} \left( 1 - \dfrac{u_{ij}}{u_0} \right) ^2 } \right] . \end{aligned}$$
(10)

4.3 Objective function

At the present work, the goal is to optimize the location of 50 DTU10MW wind turbines in order to maximize the extracted power of the wind farm, considering the wake effect previously explained. Thus, the OBF adopted is shown in Eq. (11), where \(P_{\text {kW}}\) represents the turbine extracted power and \(u_{n_{T}}\) depicts the wind velocity that reaches the turbine \(n_{T}\) (m/s) considering the wake effect.

$$\begin{aligned} \text {Max} \left( \sum \limits \limits _{n_{T}=1}^{50} P_{\text {kW}}(u_{n_{T}}) \right) . \end{aligned}$$
(11)

4.4 Simulation results

To obtain the statistical results presented in this work, 50 simulations of each technique were performed. In order to perform an unbiased comparison, all methodologies started the iterative process from the same initial solutions. In this study, the algorithm’s stopping criteria adopted were the number of iterations, set to 300, for a population of 1000 individuals.

The optimal layout obtained for this study case was obtained by the PGWO method and is depicted in Fig. 6, where the wind turbines are represented by the red lines. In this graph it can also be observed the impacts in the wind velocity caused by the wake effect. For this layout, the extracted power is of 246.953864 MW, corresponding to a capacity factor of 49.39%.

Fig. 6
figure 6

Optimal layout obtained by the PGWO method

Analyzing Table 4, it can be seen that the author’s proposed grey wolf optimizer was the only method that reached the best OBF for this study (246.953864 MW). The CGWO1 method reached the best STD value. All GWO-based methods reached the same median value of 246.658266 MW. By comparing the PGWO and GWO methods, it can be observed that the OBF value of the simulations was also higher for the proposed methodology, with a difference of 1.72% in terms of STD. The PGWO overreaches the traditional GWO in almost every parameter analyzed, fact explained by the greater dynamics introduced by the nonlinear behavior in the exploration coefficient’s \(\overrightarrow{a}_t\) update, improving the algorithm’s transition between local and global searching. Another factor to be highlighted is the fact that the only methodology that was able to obtain the optimal OBF for this real problem was the novel version proposed by the authors.

Table 4 Statistical results

Given the high complexity of this problem, mainly due to its extremely high number of possible combinations, the assertiveness obtained by the PGWO was of 4%. By comparing the results obtained by the proposed methodology and the well-known GA, BA and PSO algorithms, it can be seen that the PGWO produces better results in all aspects analyzed, such as OBF, median and STD. From the results, however, it can be seen that the BA produced good results, obtaining the third best OBF value in this study.

5 Conclusion

At this work, the authors presented a novel way of updating the exploration coefficient of the GWO meta-heuristic. The proposed methodology was tested and validated at several benchmark functions and was compared to the chaotic and traditional versions of the GWO, as well as well-known methodologies such as the GA, BA and PSO. In general, for all 10 benchmark functions, the PGWO version obtained better results than all other techniques tested in this work, with higher assertiveness rates and less dispersed results. This improvement highlights the importance of determining an well-adjusted exploration coefficient update to perform a more efficient global and local searching procedure over the course of iterations.

Given the high assertiveness and efficiency of the proposed methodology, the novel exploration coefficient was applied to solve a realistic engineering problem, consisting at determining the optimal layout of the wind turbines of an offshore wind farm, which is a highly complex multi-modal optimization problem, of combinatorial nature and nonconvex solution region. The best result obtained in this paper for the WFLO problem was reached by the PGWO methodology. The PGWO overreached all other methodologies investigated (GWO, CGWO1, CGWO2, GA, BA and PSO) in most aspects analyzed (OBF, median and mean values).

For the WFLO problem, each OBF evaluation takes about 0.02 s. Therefore, it is important to highlight the impossibility of using exhaustive search mechanisms to obtain the global optimal value for this problem, since its immense number of possible combinations (\(2^{100}\)) would require a computational time of approximately \(2.5 \times 10^{28}\) days to be completed, reinforcing the benefits of using computational intelligence algorithms such as the proposed grey wolf optimizer to solve problems of such magnitude in an acceptable computational time.