RETRACTED ARTICLE: A-COA: an adaptive cuckoo optimization algorithm for continuous and combinatorial optimization

Boveiri, H. R.; Elhoseny, M.

doi:10.1007/s00521-018-3928-9

RETRACTED ARTICLE: A-COA: an adaptive cuckoo optimization algorithm for continuous and combinatorial optimization

Intelligent Biomedical Data Analysis and Processing
Published: 15 December 2018

Volume 32, pages 681–705, (2020)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

RETRACTED ARTICLE: A-COA: an adaptive cuckoo optimization algorithm for continuous and combinatorial optimization

Download PDF

704 Accesses
18 Citations
Explore all metrics

This article was retracted on 24 April 2024

This article has been updated

Abstract

Cuckoo optimization algorithm (COA) is inspired from the special and exotic lifestyle of a bird family called the cuckoo and her amazing and unique behavior in egg laying and breeding. Just like any other population-based swarm intelligence metaheuristic algorithms, the basic COA starts with a set of randomly generated solutions called “habitats.” Actually, the habitats can be the current locations of either the mature cuckoos or their eggs. In an iterative manner, cuckoos lay their eggs around their habitats inside the other birds’ nests by mimicking their eggs’ color, pattern, and size, and this is a kind of parasitic brooding behavior. Some hosts may discriminate the strange eggs and throw them out while the others not. The survival competition between cuckoos and their hosts, and migration of cuckoos in swarm are two main underlying motivations to introduce the COA. In this paper, an adaptive cuckoo optimization algorithm named A-COA is proposed in which three novelties in egg-laying and migration phases are applied. These modifications have made the basic algorithm more efficient with faster convergence to solve continuous and discrete optimization problems. A comprehensive comparison study of A-COA versus not only the basic COA but also some other conventional metaheuristics like GA, PSO, ABC, and TLBO has been made on a variety of unimodal and multimodal numerical benchmark functions with different characteristics, and the results show an overall 25.85% of improvement in terms of performance with a faster convergence speed compared to the basic COA, where the statistical Wilcoxon rank-sum test certifies our conclusions. In addition, a discretized version of A-COA and its application to the multiprocessor task scheduling problem as a complex combinatorial optimization problem are investigated where the proposed A-COA is very competitive with not only the strongest conventional heuristics, for example, MCP, ETF, and DLS, but also the basic COA and the newly proposed ACO-based approach.

A New Cuckoo Search

An improved Cuckoo search algorithm for multi-objective optimization

Article 15 July 2017

Hybridizing Cuckoo Search with Bio-inspired Algorithms for Constrained Optimization Problems

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Most of scientific, engineering and industrial problems can be formulated as a corresponding continuous or discrete objective function with some global optima and a large number of local minima/maxima around. Actually, such kinds of problems are NP-hard from the time complexity perspective, so that there are no exact algorithms to solve them in polynomial time budget. On this basis, using exact methods is impractical specially where the dimensionality of the problem at hand is high, and hence, metaheuristic algorithms, which have a significant potential as general problem solvers, to solve such kinds of problems have been receiving increasing attention in recent years. Such methods, which most of them are inspired from natural phenomena, are capable of finding at least suboptimal solutions in a significantly reduced time budget. Considering different factors, such methods can be discriminated into the different categories. Of them, two important ones introduced in the literature are evolutionary algorithms (EA) and swarm intelligence (SI) ones.

The most recognized evolutionary algorithm is genetic algorithm (GA) introduced by Holland [1]. It is a stochastic searching method based on the theory of Darwinian natural evolution of the living beings. Natural evolution is defined as the adaptation of all the species in dynamic nature based on the environmental feedback. During the species evolution by survival of the fittest, each generation transfers better chromosomes to the next generation by exchanging chromosomal material during breeding. By means of this mechanism so-called crossover and another one named mutation, defining as randomly changing some attributes in the chromosomes of offspring, each species converges toward a fittest generation for living in the current environment. Of the other evolutionary algorithms are differential evolution (DE) introduced by Storn and Price [2], which is similar to GA but with a specialized crossover and selection method, evolution strategy (ES) introduced by Rechenberg [3], evolution programming (EP) introduced by Fogel et al. [4], artificial immune algorithm (AIA) introduced by Farmer et al. [5] which works on the basis of immune system of the human being, and bacteria foraging optimization (BFO) introduced by Passino [6], which works on the behavior of bacteria, to name a few.

On the other hand, some well-known swarm intelligence-based algorithms are particle swarm optimization (PSO) introduced by Kennedy and Eberhart [7], which works on the foraging behavior of the flocks of birds, ant colony optimization (ACO) introduced by Dorigo et al. [8], which works on the foraging behavior of the real ant for food, shuffled frog leaping (SFL) introduced by Eusuff and Lansey [9], which works on the principle of communication among the frogs, artificial bee colony (ABC) algorithms introduced by Karaboga [10], which works on the foraging behavior of honey bees toward the food sources, gray wolf optimizer (GWO) introduced by Mirjalili et al. [11], which mimics the leadership hierarchy and hunting mechanism of gray wolves in nature, wale optimization algorithm (WOA) introduced by Mirjalili et al. [12], mimicking the hunting mechanism of humpback whales in nature, to mention a few.

Also, apart from the aforementioned evolutionary and swarm intelligence-based algorithms, some other algorithms have been introduced on the basis of different natural phenomena. Some of them can be enumerated as harmony search (HS) algorithm introduced by Geem et al. [13], which works on the principle of music improvisation of a music player, the gravitational search algorithm (GSA) introduced by Rashedi et al. [14], which works on the principle of gravitational force acting between the bodies, biogeography-based optimization (BBO) introduced by Simon [15], which works on the principle of immigration and emigration of the species from one place to the other, the grenade explosion method (GEM) introduced by Ahrari and Atai [16], which works based on the principle of explosion of a grenade, the league championship algorithm (LCA) introduced by Kashan [17], the charged system search (CSS) introduced by Kaveh and Talatahari [18], and teaching–learning-based optimization (TLBO) algorithm introduced by Rao et al. [19], which works based on the interacting behavior of a teacher and some learners in a classroom.

Cuckoo optimization algorithm (COA) is a novel swarm intelligence-based optimization algorithm first introduced by Rajabioun [20], inspired from the exotic lifestyle of a bird family named the cuckoo. Unique egg-laying and breeding characteristics of cuckoos called parasite brooding are the basis of constituting this metaheuristic algorithm. Each solution vector in the COA is represented by a “habitat” which is the current location of either a mature cuckoo in the society or an individual egg. Mature cuckoos lay their eggs in some other birds’ nests by mimicking their eggs’ color, pattern, and size. If the host birds are unable to discriminate and kill the cuckoos’ eggs, they will gain a big chance to grow and become mature cuckoos. By means of this parasite brooding behavior besides the immigration of societies (groups) of cuckoos, they converge to the best environments for breeding and reproduction. Actually, these most profitable environments are supposed to be the global optima of the given objective function of the optimization problem at hand. The COA has gained an increasing popularity in the past few years and been applied on the variety of applications [21]; Elyasigomari et al. [22]; Faradonbeh and Monjezi [23]; Bazgosha et al. [24].

In this paper, we propose an adaptive version of cuckoo optimization algorithm named A-COA in which three novelties in egg-laying and migration phases are applied as follows: (1) The egg-laying radius (ELR) is nonlinearly reduced over the iterations for faster convergence, madding a better balance between exploration and exploitation specially for the last iterations. (2) After egg-laying phase, those eggs laid in the same locations (with an ε tolerance) are recognized and killed except one of them. While in the basic COA the epsilon coefficient is constant over the execution, in the A-COA this coefficient is decreased linearly with the iterations. This idea lets compacter populations in the last iterations help with the exploitation and local search for getting exact final solutions. (3) In contrast to the basic COA in which the motion coefficient is constant during the algorithm execution, in the A-COA, we apply an adaptive motion coefficient for each cuckoo based on the cuckoo’s distance to the globally best cuckoo; i.e., the farer cuckoo will get higher coefficient to move forward. This idea, on the one hand, helps the farer cuckoos get trapped in the local minima having a big jump toward the global best and exiting off the stagnation while helping the convergence speed; on the other hand, it slows down the nearer cuckoos for a better investigation around the global best cuckoo in order to improve the accuracy, as well as to avoid premature convergence.

These modifications make the proposed A-COA more efficient with faster convergence in comparison with the basic COA. To prove that, a comprehensive comparison study of A-COA versus not only the basic COA but also some other conventional popular approaches in the literature like GA, PSO, ABC and TLBO has been conducted using a variety of numerical unimodal and multimodal optimization benchmark functions with different characteristics, and diverse conclusions have been made. In addition, a discretized version of A-COA and its application to the multiprocessor task scheduling problem (MTSP) as a complex combinatorial optimization problem are investigated where the proposed A-COA is very competitive with not only the strongest conventional heuristics, e.g., MCP, ETF, and DLS, but also the basic COA and the newly proposed ACO-based approach.

The rest of the paper is organized as follows. The basic COA is described in the following section. Section 3 introduces the proposed A-COA approach and the philosophies behind the modifications and improvements made. Section 4 explains the implementation details and configurations. Section 5 is devoted to the achieved experimental results and the comparison study. The discretized version of A-COA and its application to the multiprocessor task scheduling problem (MTSP) as a complex combinatorial optimization problem are investigated in Sect. 6, and finally, the paper is concluded in Sect. 7.

2 Cuckoo optimization algorithm (COA)

Figure 1 shows the flowchart of the basic cuckoo optimization algorithm (COA). At first, COA produces a randomly generated initial population of N_pop solutions, where N_pop is the size of the population. In COA, each solution of the given problem is represented by a “habitat vector,” for example H_i = [x_i,1, x_i,2, …, x_i,Nvar]^T, which indicates the location of either a mature cuckoo or an individual egg, where each x_i,j is an optimization parameter (decision variable) for the solution H_i, N_var is the total number of optimization parameters (the dimension of each solution vector), and T denotes the vector transposition. In this way, the algorithm starts with a candidate habitat matrix with the size of N_pop× N_var. Then, the desirability of each randomly generated habitat is calculated using a corresponding fitness function. The fitness function can be shown by f so that f_i = f (H_i) = f (x_i,1, x_i,2, …, x_i,Nvar) is the desirability (fitness value) of the i-th habitat. If the minimization is to be in consideration, the lower fitness value is achieved and the better solution is obtained.

After initialization, the population of the habitats (solutions) is subjected to some repeated cycles, iter = 1, 2,…, iter_max, of the search process. In each iteration, firstly some randomly generated number of eggs (Negg_i, i = 1, 2, …, N_pop) is considered for each cuckoo. In the nature, each cuckoo may lay eggs ranging from five to 20, which are suitable as the upper and lower bounds of egg quota for each cuckoo in most of the problems. Another thing to be considered is that generally cuckoos lay eggs within a limited distance from their habitats called egg-laying radius (ELR). In an optimization problem with upper and lower bounds as x_max and x_min for the decision variables, respectively, each cuckoo has an ELR which is proportional to the total number of eggs laid by all the cuckoos, the number of current cuckoo’s eggs, and also the difference between x_max and x_min. On this basis, the ELR for each cuckoo to lay its eggs is defined as

$${\text{ELR}}_{i} = \alpha \times \frac{{{\text{Negg}}_{i} }}{{{\text{Total}}\;{\text{number}}\;{\text{of}}\;{\text{eggs}}}} \times \left( {x_{\hbox{max} } - x_{\hbox{min} } } \right)$$

(1)

where Negg_i is the number of eggs laid by the i-th cuckoo and α is the egg-laying coefficient, a constant real value supposed to handle the maximum value of ELR.

In this way, each cuckoo starts laying eggs in some other host birds’ nests within her ELR in a random fashion. Figure 2 shows such kind of random egg laying in ELR. In the next step, firstly, those eggs laid in the same locations with an ɛ tolerance are detected and killed and secondly, p% of all the eggs (usually 10%) with less profit values are thrown away. Obviously, these eggs had no chance to grow and contribute to the society. Rest of the eggs grow and are considered as mature cuckoos; i.e., their habitats are included in the habitat’s set to be selected for the next iterations. In addition, another interesting point about laying eggs by cuckoos is that only one egg in a nest has the chance to grow. On this basis, we check all the eggs’ locations, and those eggs in the same locations (with an epsilon tolerance) will be killed except one of them.

After maturation of young cuckoos, they may stay in their own society or immigrate to the new and supposedly better habitats for egg laying and reproduction. Hence, we first group the cuckoos to some disjoint clusters and select the society with the best profit value as the goal point for other cuckoos to immigrate. A k–means clustering method is used where a k ranging from 1 to 3 seems to be sufficient in the simulations [20]. Then, we should calculate the mean profit value for all the groups. The maximum mean profit determines the goal group, and consequently that group’s best habitat (H_best) is the new destination habitat for all the cuckoos to migrate using (2).

$$H_{i.j}^{\text{new}} = H_{i.j}^{\text{old}} + F \times {\text{rand}}\left( {0. 1} \right) \times \left( {H_{{{\text{best}} \cdot j}} - H_{i \cdot j}^{\text{old}} } \right)|\quad i = \, 1, \, 2, \, \ldots ,N_{\text{pop}} \;{\text{and}}\;j = \, 1, \, 2, \ldots ,N_{\text{var}}$$

(2)

where F is the migration coefficient and rand (0, 1) generates random number in the range of [0, 1]. As shown in Fig. 3, it is worth mentioning that the cuckoos do not fly all the way to the destination habitat when they move toward the goal point, but they only fly a part of the way (λ) and also with some deviation (φ). A λ randomly selected in the range of (0, 1] and φ generated randomly in the range of (− π/6, + π/6) are introduced suitable for good convergence in the basic COA [20]; therefore, the migration coefficient (F) should be tuned up accordingly.

3 The adaptive cuckoo optimization algorithm (A-COA)

In this section, the modifications to improve the COA are described, and an adaptive version is proposed named A-COA which is capable of solving both continuous and combinatorial optimization problems efficiently with faster convergence. Up to our knowledge, most of the parameters in the basic COA, for example ELR coefficient (α), epsilon coefficient (ε), and migration coefficient (F), are stochastic and have potential to be enhanced. On this basis, in our adaptive version, three novelties in egg-laying and migration phases are applied as follows.

3.1 Decreasing the egg-laying radius coefficient nonlinearly

The egg-laying radius coefficient (α) in (1) is a parameter to control the length of ELR which is the maximum distance among which a cuckoo can lay its eggs. Higher α lets cuckoos lay eggs in farer distance which is suitable for the exploration of search space (as global search) while lower α bounds this distance suitable for the exploitation and local search. To make a balance between exploration and exploitation, a logical idea, as shown in Fig. 4, is that this parameter should be high in the first iterations to help with the exploration (global search) and be decreased in the last iterations to help with the exploitation (local search). We argue that this reduction should not be in a linear form; i.e., α is better to be near its maximum value in the first iterations while near its minimum possible value in the last ones. On this basis, α_t, i.e., the egg-laying radius coefficient for the iteration t, is computed in each iteration using (3):

$$\alpha_{t} = \alpha_{\hbox{max} } - \frac{{\alpha_{\hbox{max} } - \alpha_{\hbox{min} } }}{{{\text{iter}}_{\hbox{max} } - {\text{iter}} + 1}}$$

(3)

where α_min and α_max are the minimum and maximum values considered for egg-laying radius coefficient, respectively. Simply, the α_min can be assumed as zero, and just α_max need be investigated experimentally.

3.2 Managing epsilon coefficient

As mentioned before, after egg-laying phase, those eggs laid in the same locations (with an ε tolerance) should be recognized and killed except one of them. Obviously, these eggs had no chance to grow and contribute to the society. While in the basic COA the epsilon coefficient is constant all over the execution, in the A-COA this coefficient is decreased linearly with the iterations using (4):

$$\varepsilon_{t} = \varepsilon_{\hbox{max} } - {\text{iter}} \times \frac{{\varepsilon_{\hbox{max} } - \varepsilon_{\hbox{min} } }}{{{\text{iter}}_{\hbox{max} } }}$$

(4)

where ε_min and ε_max are the minimum and maximum values considered for the epsilon coefficient, respectively, and ε_t is this coefficient for the iteration t. Simply, the ε_min and ε_max can be assumed as zero and the constant value offered in the basic COA, respectively. This idea as shown in Fig. 5 lets compacter populations in the last iterations help with the exploitation and local search for getting exact final solutions.

3.3 Adaptive migration coefficient

The definition of migration coefficient (F) and its correspondence with the motion ratio (λ) and deviation (φ) is very obscure in the basic COA. Actually, it is not clear how the author can calculate F to be used in (2) based on the given λ and φ. Also, whether the F is constant in all the iterations or should change for every cuckoo or even for every dimension of the given problem, is a question to be answered. In A-COA, we suggest to apply an adaptive motion coefficient for each cuckoo based on the cuckoo’s distance to the global best using (5); i.e., the farer cuckoo will get higher coefficient to move forward and vice versa (Fig. 6):

$$F_{i} = F_{\hbox{min} } + \frac{{f_{i} - f_{\hbox{min} } }}{{f_{\hbox{max} } - f_{\hbox{min} } }} \times \left( {F_{\hbox{max} } - F_{\hbox{min} } } \right)$$

(5)

where f_min and f_max are the minimum and maximum fitness values in the current iteration, respectively, and f_i is the fitness value for the cuckoo under consideration. On the one hand, this idea helps the farer cuckoos get trapped in the local minima having a big jump toward the global best and exiting off the stagnation while helping the convergence speed; on the other hand, it slows down the nearer cuckoos for a better investigation around the global best in order to improve the accuracy, as well as avoid premature convergence.

4 Implementation details and configurations

The proposed A-COA was implemented on a Pentium IV (8-core 3.9 GHz i7–3770 K processor) desktop computer with Microsoft Windows 7 (X64) platform using Microsoft Visual Basic 6.0 programming language. The A-COA’s flowchart and an implementation in pseudocode are demonstrated in Figs. 7 and 8, respectively. A set of 20 numerical unimodal and multimodal benchmark functions with various search space structures were considered to fully evaluate the proposed approach. All of these benchmark functions are minimization ones described in the following subsection. Besides, a complete list of stochastic optimization algorithms, i.e., not only the evolutionary algorithms such as GA but also the swarm intelligence-based methods such as PSO, ABC, and TLBO, are considered for a rational judgment about the performance of the proposed A-COA. A full list of configurations used to tune these algorithms is shown in Table 1. Since the overall performance of each algorithm is fully dependent on its configuration, we pay a full attention to this issue and propose the most adequate configuration for each individual algorithm based on the large number of experiments, and this is considered to be another contribution of this paper (we guarantee that all the configurations are optimized so that the achieved results for each numerical benchmark function are better, or at least equal to the ones obtained by original or state-of-the-art configurations by other authors). On the other hand, some parameters are identical for all of the utilized algorithms; for example, the population size is set to 40 for all of them (except for COA and A-COA that are set to 20), the number of iterations is set to 1000, and each algorithm will be terminated after 1333 × D times of fitness function evaluation (FFE), where D is the number of dimensions for the given benchmark function under the experiment. Since in all the experiments in this paper, D is set to 30, the termination criterion is 40,000 FFE for all the algorithms (we again guarantee that 40,000 FFE is enough for all the algorithms to release their full potential on the both unimodal and multimodal benchmark functions with the dimensionality of 30).

Table 1 Configurations of the algorithms used in comparison study

Full size table

4.1 Comparison benchmark functions

Tables 2 and 3 list two different sets of ten multidimensional unimodal and multimodal numerical benchmark functions, respectively, which are very popular and applicable in the literature. By definition, the unimodal functions are those that have only one peak and valley as global optimal point, while the multimodal ones have a number of global and local minima/maxima distributed over the search space and are indeed very harder to be solved. Besides, multidimensionality of these functions enables us to conduct the experiments using different number of decision variables ranging from low dimensions (easy-to-be-solved) to very high ones (hard-to-be-solved).

Table 2 Multidimensional unimodal numerical benchmark functions

Full size table

Table 3 Multidimensional multimodal numerical benchmark functions

Full size table

4.2 Competitor optimization algorithms for comparisons

This subsection is devoted to introduce those conventional metaheuristic optimization algorithms with which the comparison study versus A-COA will be made. For a rational judgment, either an evolutionary computation algorithm, i.e., genetic algorithm (GA), or three swarm intelligence-based algorithms named particle swarm optimization (PSO), artificial bee colony (ABC), and teaching–learning-based optimization (TLBO) are considered and described as follows:

Genetic algorithm (GA) GA was first introduced by Holland [1]. It is a stochastic searching method based on the theory of Darwinian natural evolution of the living beings. This algorithm is started with a set of randomly generated solutions called initial population. Each member in the population is called a chromosome which is actually a solution of the given problem and itself consists of a string of genes. The number of genes in each chromosome and their acceptable value’s ranges are depended on the problem specification; for example, in combinatorial function optimization, the number of genes for each chromosome corresponds to the number of optimization variables, and the gene values are bounded by an upper and lower bounds of these variables. A set of chromosomes (population) in each iteration is called a generation. The generation is evaluated by a fitness function in order to find the desirability of each individual. Afterward, some offspring (the new generation) is created by applying some operators on the current generation. These operators are crossover which selects two chromosomes as parents, combines them and generates two new offspring, and mutation which changes randomly value of some genes in a selected chromosome and creates a new offspring. Then, the best children and maybe their parents are selected by evolutionary selection operator according to their fitness values using methods like ranking, roulette wheel, tournament, and so on. These three phases of production, i.e., manipulation, evaluation and selection, are repeated until some conditions are satisfied, and finally, the best chromosome in the last generation is returned as the best global solution.
Particle swarm optimization (PSO) PSO was first introduced by Kennedy and Eberhard [7], originated from the mystery of migration and the foraging behavior of the flocks of birds (called particles) for food [7]. In this technique, all the particles search for the food in multidimensional search space based on their two important characteristics, i.e., the current position referred to as the suggested solution $\left( {x_{i.j} \left( t \right)} \right)$ and velocity or changing rate of the particle position $(v_{i.j} \left( t \right))$ using (6) and (7).

$$x_{i.j} \left( t \right) = x_{i.j} \left( {t - 1} \right) + v_{i.j} \left( t \right)\quad |i = \, 1, \, 2, \ldots ,N_{\text{pop}} \;{\text{and}}\;j = \, 1, \, 2, \ldots ,N_{\text{var}}$$
(6)
$$v_{i.j} \left( t \right) = w \times v_{i.j} \left( {t - 1} \right) + \varphi_{1} \left( {x_{i.j}^{p} - x_{i.j} \left( {t - 1} \right)} \right) + \varphi_{2} \left( {x_{j}^{g} - x_{i.j} \left( {t - 1} \right)} \right)$$
(7)
where w is an inertia factor to tune the velocity in each iteration, $x_{i}^{p}$ is the personal best position visited yet by the particle $x_{i}$, $x^{g}$ is the global best particle in the population, and $\varphi_{1} = c_{1} \times {\text{rand}}\left( {0. 1} \right)$ and $\varphi_{2} = c_{2} \times {\text{rand}}\left( {0. 1} \right)$ are randomly generated personal and global coefficients, respectively, for knowledge exploitation in the algorithm (where c₁ and c₂ are set to 2 in most the cases). Obviously, if any particle finds a better path to the food’s location, it becomes the global best and attracts other particles to follow its path (global search). On the other hand, each particle exploits its own personal best location as a local search around itself. All particles move slowly toward the obtained solution updating their personal best and the global best solution. At the end, all particles reach the same position which are supposed to be the best global solution of the given problem.
Artificial bee colony (ABC) ABC algorithm was first introduced by Karaboga [10], working based on the foraging behavior of a colony of honeybees [10]. In the ABC algorithm, the colony of artificial bees is divided into three groups of different bees just like their real-world counterparts, i.e., employed bees, onlookers, and scouts. A bee waiting on the dance area for making decision to choose a food source is called an onlooker, and a bee going to the food source previously visited by itself is named an employed bee. On the other hand, a bee carrying out random search to find probably undiscovered food source yet is called a scout. In the ABC algorithm, the first half of the colony consists of employed artificial bees, and the second half constitutes the onlookers. For every food source, there is only one employed bee; i.e., each individual employed bee is associated with a certain food source. The employed bee whose food source is exhausted becomes a scout starting new randomly flights around the hive. At the initialization stage, a set of food source positions are randomly selected by the employed bees, and their nectar amounts are determined using the given fitness function. Then, the iterative searching algorithm begins, each cycle of which consists of three steps: (1) Each employed bee, for example i-th, selects another food source, for example k-th, randomly and goes toward it from its associated food source using (8).

$$v_{i.j} \left( t \right) = x_{i.j} \left( t \right) + w_{1} \;{\text{rand}}\left( { - 1.1} \right)(x_{i.j} \left( t \right) - (x_{k.j} \left( t \right))|\quad i = \, 1, \, 2, \ldots ,N_{\text{pop}}$$
(8)
where j ∈ {1, 2, …, N_var} is a randomly selected index as an individual dimension and w₁ is the migration coefficient used to tune and control global search in ABC. Then, the nectar amount of this location ($v_{i.j} \left( t \right)$) is measured; if the fitness value of this location is better than the previous food source location ($x_{i.j} \left( t \right)$), it is replaced with newly discovered location; else, it will be remained unchanged. (2) Each onlooker bee selects a food source, for example i-th, using a roulette wheel selection based on the nectar amount of the foods. It selects another food source, for example k-th, randomly and goes from the i-th food source to the selected k-th one using (8) again where w₂ is used instead. Actually, w₂ is a coefficient used to tune and control local search in ABC. At this time, the onlooker bee measures the nectar amounts of the neighborhood ($v_{i.j} \left( t \right)$); if the fitness value of the neighborhood is better than the current food source location ($x_{i.j} \left( t \right)$), it is replaced with its neighborhood location; else, it will be remained unchanged. (3) Determining each scout bee, for example i-th one (each food source that has not been changed for a limited number of iterations simply known as limit), and then sending it to search for the potential yet undiscovered food sources using (9).

$$x_{i.j} \left( t \right) = x_{\hbox{min} } + {\text{rand}}\left( {0.1} \right)(x_{\hbox{max} } - x_{\hbox{min} } )|\quad {\text{for}}\;{\text{every}}\;j = \, 1, \, 2, \ldots ,N_{\text{var}}$$
(9)
By means of these simple steps, the bees will converge to the most profitable locations in terms of the given fitness function.
Teaching–learning-based optimization (TLBO) TLBO algorithm was first introduced by Rao et al. [19], working based on the interacting behavior of a teacher and some learners in a classroom [19]. Teaching–learning is an important motivating process where any individual tries to learn something from the others. Traditional classroom teaching–learning environment is one sort of motivating process where the students try to learn from a teacher as well as to share their learned subjects to improve their knowledge. Based on this interacting process, TLBO has been proposed which simulates the traditional teaching–learning phenomenon in a classroom. Actually, TLBO is a population-based algorithm where a group of students (i.e., solution vectors) is considered, and the different subjects offered to the learners are analogous with the manipulation of different decision variables of the given optimization problem. The algorithm simulates two fundamental modes of learning: (1) learning through teacher known as the teaching phase (global search) and (2) interacting with the other learners known as the learning phase (local search). In each iteration, the best solution in the entire population is considered as the teacher to perform teaching phase, and then learners start to share their knowledge to each other as to perform learning phase. In this way, the whole population converge to a same position supposed to be the best global solution of the problem under consideration.

5 The achieved results and comparison study

Table 4 shows the results obtained by each optimization algorithm on the unimodal benchmark functions listed in Table 2 with a dimension of 30 decision variables. It is worth mentioning that each result illustrated in this paper is extracted from 30 independent runs as mean and standard deviation for the algorithm in consideration. For each function, the bolded number is the best result achieved by the algorithms. As can be seen, the TLBO outperforms the others by far in this set of experiments. Actually, the rank-sum-based ranking for the algorithms drawn by this set of experiments is {TLBO, PSO, ABC, GA, A-COA, and COA}, which indicates that the TLBO is the best and COA is the worst from the performance point of view.

Table 4 Results achieved by the algorithms on unimodal functions (dimension = 30)

Full size table

Nevertheless, as stated in [25], we should not exclusively rely on these results because most of the benchmark functions have a global minimum in [0, 0, …,0]^T, which can be exploited as a background knowledge for some algorithms to promptly converge to this point. In order to address the issue, Liang et al. [26] suggested the utilization of these randomly shift-rotated benchmark functions. On this basis, another set of experiments were conducted. Table 5 shows the results obtained by each optimization algorithm on the shift-rotated previous unimodal functions. Surprisingly, the TLBO not only loses its efficiency versus other methods but also unable to find any best solution for all the functions! The resulting rank-sum-based ranking for the algorithms drawn by this set of experiments is {ABC, PSO = TLBO, GA, A-COA, and COA}, suggesting the superiority of ABC and retardation of COA in terms of performance. Again, the proposed A-COA has a better ranking versus the basic COA.

Table 5 Results achieved by the algorithms on the randomly shift-rotated unimodal functions (dimension = 30)

Full size table

On the other hand, most of the actual engineering optimization problems have a multimodal nature; i.e., the objective function in consideration may have a large number of suboptimal local minima as well as a few identical global ones distributed over the search space. Therefore, it is very important for each algorithm to illustrate a high potentiality to solve such kinds of problems efficiently. Table 6 shows the results achieved by each optimization algorithm on the multimodal benchmark functions listed in Table 3 with a dimension of 30 decision variables. As one can see, the ABC outperforms the others in this set of experiments. It can be said that the rank-sum-based ranking for the algorithms drawn by this set of experiments is {ABC, TLBO, GA, PSO, A-COA, and COA}, indicating the superiority of ABC versus the others from the performance perspective.

Table 6 Results achieved by the algorithms on multimodal functions (dimension = 30)

Full size table

Again, another set of experiments should be conducted using the aforementioned shift-rotated multimodal benchmark functions. Table 7 shows the results obtained by each optimization algorithm for these experiments. For another time, we observe a reordering among the algorithms where the resulting rank-sum-based ranking for the algorithms drawn by this set of experiments is {ABC, GA, PSO, TLBO, A-COA, and COA}; i.e., ABC has the best performance and COA is the worst. Still, the proposed A-COA has a better ranking versus the basic COA which certifies the superiority of the proposed A-COA over the basic COA from the performance point of view.

Table 7 Results achieved by the algorithms on randomly shift-rotated multimodal functions (dimension = 30)

Full size table

5.1 A-COA versus the basic COA

Figures 9 and 10 show the improvement diagrams (in percentile) of the proposed A-COA compared to the basic COA for all the original and shift-rotated unimodal and multimodal benchmark functions, respectively. The most improvements in original functions belong to the Schwefel N1.2 and Schwefel functions (one unimodal and another multimodal functions) with 80% and 100% of improvement, respectively, while the A-COA has a worse performance on Schwefel N2.21, Rastrigin, Ackley, NCRastrigin, and Penalized functions up to − 31%, − 2%, − 7%, − 24%, and − 39%, respectively. On average, a 37% improvement for original unimodal functions and a 13% for the original multimodal ones are observed.

On the other hand, the most improvements for shift-rotated functions belong to the Schwefel N1.2, Schwefel (again) as well as Penalized2 functions (where the first is unimodal and the last both are multimodal functions) with 70%, 89%, and 60% of improvement, respectively, while A-COA has a worse performance on Schwefel N2.21, Elliptic, and NCRastrigin functions up to − 19%, − 6%, and − 8%, respectively. On average, there are a 29% and a 23% of improvement for the shift-rotated unimodal and multimodal benchmark functions, respectively. Finally, the proposed A-COA has an overall 25.85% of improvement over the basic COA considering all the 20 original and shift-rotated unimodal and multimodal benchmark functions used in comparison study which is of course a significant improvement.

5.2 Statistical tests

There are a lot of metrics to compare the performance of different algorithms versus each other such as best solution, worst solution, sample mean for a predefined number of independent runs, sample deviation, speedup, etc., while none of them are able to determine the superiority of an individual algorithm in a definite way, and here we need the statistical tests to be sure about our conclusions to a degree of confidence. The Wilcoxon rank-sum test [27] is a nonparametric statistical test used to compare two independent samples to assess whether the real means of their ranks are different or not. Actually, this test make us capable of determining whether two independent samples are selected from the same population with identical distribution or not; if so, none of them has an actually better performance; otherwise, one of them may outperform the other based on the achieved results. Here, the test is conducted at the significance level of α = 0.05 (95% of confidence interval). The h values in Tables 8 and 9 show the results of the significance comparisons over all the 20 original and shift-rotated benchmark functions, respectively, where h = 1, h = 0, and h = − 1 indicate that the A-COA has statistically better, equal, or worse performance compared to the basic COA with 95% of confidence, respectively.

Table 8 Results of the Wilcoxon rank-sum test to compare the A-COA versus the basic COA for all the 20 original benchmark functions with 95% of confidence

Full size table

Table 9 Results of the Wilcoxon rank-sum test to compare the A-COA versus the basic COA for all the 20 shift-rotated benchmark functions with 95% of confidence

Full size table

5.3 Convergence speed

In order to study the convergence speed of the proposed A-COA and basic COA, two unimodal (Sphere and Rosenbrock) and two multimodal (Ackley and Schwefel) benchmark functions among Tables 2 and 3 were considered. The settings and configurations are exactly the same as the previous experiments; for example, the dimension is 30 and the maximum number of FFEs is 40,000. In case that the algorithms are terminated before the maximum number of FFE is for all the population is trapped in a same point, and no further investigation is possible. Figures 11 and 12 are the convergence diagram (the fitness value of the best cuckoo in the population) between the function value (in logarithmic scale) and algorithm iterations. The plotted convergence graphs for both unimodal and multimodal functions demonstrate the superiority of the proposed A-COA over the basic COA from the convergence rate point of view, and this is true for the most other benchmark functions, as we have tested.

6 Multiprocessor task graph scheduling using the proposed approach

In this section, we investigate the application and performance of the proposed A-COA on multiprocessor task scheduling problem (MTSP). Task graph scheduling optimization plays an essential role in different computational environments such as multiprocessor systems in which a number of processing elements are coupled and perform as a whole high-performance supercomputer [28, 29]. In such systems, each application program is decomposed into smaller and maybe dependent subprograms named tasks. Some tasks need the data generated by other tasks so that there will be precedence constraints among them. On this basis, each application problem can be modeled using a directed acyclic graph (DAG), the so-called task graph. In a sample task graph, nodes indicate tasks and edges are precedence constraints among them. In static scheduling, all the requisite parameters such as required execution times of tasks, communication costs among them, and precedence constraints are determined during the program’s compiling step. The main objective is to derive an appropriate topological order of tasks from the given task graph and assign them to a number of computational elements respecting the tasks’ precedence constraints in such a way that some criteria such as overall finish time of the given program or total energy consumption by processors are minimized [30,31,32]. This is an NP-hard problem from the time complexity perspective so that the exact methods may not be able to respond in a predefined and restricted time budget, especially for large-scale inputted samples [33].

Most conventional as well as state-of-the-art approaches reported in the literature to cope with the task scheduling problem in such environments are working based on the list scheduling method. These methods can be divided into two important categories: (1) heuristic approaches, for example HLFET¹ [34], ISH² [35], CLANS³ [36], LAST⁴ [37], ETF⁵ [38], DLS⁶ [39], and MCP⁷ [40], which exploit different priority measurements of tasks to navigate the search process, and (2) metaheuristics, for example ACO-based [41,42,43,44,45] and CLA-based [46, 47] ones, which rely on the foraging potentiality of these nature-inspired approaches. The philosophy behind the list scheduling technique is that a complete set of ready tasks is selected as a ready list in each iteration. The ready tasks are either those without any parents or without any unscheduled ones. Then, in continuum of each iteration, the task with the most priority in the ready list is chosen to be allocated to that computational unit (processor) allowing the earliest start time (EST), until all the tasks in the task graph are scheduled. Ref. Kwok and Ahmad [33], Boveiri [48], Buyya [28] and Cao et al. [30] are of the most complete reviews on these systems and the corresponding task scheduling problem. Some real-world applications requiring such a supercomputer infrastructure can be enumerated as biomedicine and bioinformatics [49, 50], urbane surveillance and smart city [51], quantum computing [52], and big-data processing [53], to mention a few.

6.1 Problem formulation

A directed acyclic graph G = {N, E, W, C} called task graph is considered as a model to formulate a parallel application program executed on a multiprocessor computing environment, where N = {n₁, n₂,…, n_n}, E = {(n_i, n_j)|n_i, n_j ∈ N}, W = {w₁, w₂,…, w_n}, C = {c(n_i, n_j)|(n_i, n_j) ∈ E), in which N is a set of nodes, E is a set of edges, W is the set of the weights of the nodes, C is the set of weights of the edges, and n is the number of nodes in the task graph.

Figure 13 shows the task graph of a real parallel program comprised of nine different tasks inside. In such a graph, nodes indicate tasks (so we use them interchangeably across the paper), each of which to be executed only one time and only on one processor, and edges are precedence constraints among them. Each edge such as (n_i, n_j) ∈ E demonstrates that the task n_i is over before the task n_j starts. In this case, n_i is called a parent for n_j, and n_j is called a child for n_i. The “entry nodes” and “exit nodes” are definitions applied for those nodes without any parents and nodes without any children, respectively. Each node’s weight such as w_i is the necessary execution time for task n_i, and each weight of edge such as c(n_i, n_j) is the time required for data transmission from task n_i to task n_j identified as communication cost/delay. If both tasks n_i and n_j are executed on the same processor, the communication cost will be zero between them. In static scheduling, all the decision parameters, i.e., execution times of tasks, precedence constraints among them, and communication costs, are available beforehand and generated during the program’s compiling stage so that the scheduling can be deterministic. Tasks should be mapped into the given set of m processors, i.e., P = {p₁, p₂,…, p_m}, according to their precedence so that the overall finish time (makespan) of the given problem would be minimized.

Most different scheduling algorithms in different categories are based on the same strategy called list scheduling technique. The underlying philosophy behind the list scheduling is to make a sequence of nodes as a list by assigning them some priorities [55], and then, repeatedly removing the most prior node from the list, and mapping it to the computational unit (processors) that allows the EST, until the scheduling of all the nodes in the given task graph.

If the parent set (all the parents) of a task, for example n_i, were executed on a processor, for example p_s, EST (n_i, p_s) would be Avail (p_s), that is, the earliest time at which p_s is available to execute the next task. Otherwise, the earliest start time of task n_i on the processor p_s should be computed using Eq. (10).

$${\text{EST}}(n_{i} ,p_{s} ) = \left\{ {\begin{array}{*{20}l} {0,} \hfill & {{\text{if}}\;n_{i} = {\text{entery}} - {\text{node}}} \hfill \\ {\mathop {\hbox{max} }\limits_{{n_{k} \in {\text{Parent}}(n_{i} )}} \left\{ {\begin{array}{*{20}l} {\left( {{\text{AFT}}(n_{k} )} \right),} \hfill \\ {\left( {{\text{AFT}}(n_{k} ) + c(n_{k} ,n_{i} )} \right),} \hfill \\ \end{array} } \right.} \hfill & {\left. {\begin{array}{*{20}l} {{\text{if}}\;{\text{Processor}}(n_{k} ) = p_{s} } \hfill \\ {\text{else}} \hfill \\ \end{array} } \right\}} \hfill \\ \end{array} } \right.,$$

(10)

where AFT(n_k) = AST(n_k) + w_k is the actual finish time of the task n_k, Parents(n_i) is the set of all the parents of n_i, Processor(n_k) is a function that returns the index of processor in which n_k was run, and AST(n_k) is the actual start time of task n_k, which can be computed using Eq. (11).

$${\text{AST}}(n_{k} ) = \mathop {\hbox{min} }\limits_{x = 1}^{m} \left( {\hbox{max} \left( {{\text{Avail}}(p_{x} ),{\text{EST}}(n_{k} ,p_{x} )} \right)} \right)$$

(11)

Finally, the total finish time of the given parallel program is calculated using Eq. (12):

$${\text{makespan}} = \mathop {\hbox{max} }\limits_{i = 1}^{n} \left( {{\text{AFT}}(n_{i} )} \right)$$

(12)

For a given task graph with n tasks inside using its adjacency matrix, an efficient implementation of the EST method for mapping all the tasks over a given m identical processors’ machine has a time complexity belonging to O(mn²) [56].

6.2 Discretization and application of A-COA to MTS

Actually, the basic A-COA is naturally proposed for continuous optimization while the MPSP is intrinsically a discrete combinatorial problem. Therefore, to resolve the issue, we propose the following discretized version of the A-COA. In the proposed approach, each solution called a “scheduling” is equivalent to a “habitat” in the COA. Actually, a habitat or scheduling is just a list with the length of n, where n is the number of tasks in the given task graph. Accordingly, each element/cell of this list is associated with one individual task of the inputted task graph; for example, H [34] is associated with task n₁, H [16] is associated with the task n₂, and so on. The value of each cell is the priority of selecting that task which is a real number in the range of [0, 1]; the higher the priority, the sooner it will be selected for scheduling. Figure 14 shows a typical habitat and its corresponding task order as well as the final task mapping on two processors demonstrated by a Gantt chart, where the inputted task graph is the one shown in Fig. 13. On this basis, the COA’s duty is to properly adjust the values of cells (priorities) so that the optimum scheduling can be eventuated. For this aim, first of all, all the input parameters are adjusted (Algorithm 1), and a set of habitats is created as the cuckoos’ population (each task priority or cell’s value is initiated randomly in the range of [0, 1]). In the following, each habitat, for example habitat[i], is evaluated, and the final makespan achieved is computed using Algorithm 2 as the habitat’s fitness value. To do this, first of all, the corresponding task order based on the tasks’ priorities in the habitat is extracted. Secondly, this task order is distributed over the processors using the EST method, and the overall finish time (makespan) is calculated using Eqs. (10), (11) and (12), which is also considered as the fitness/objective value or the desirability of this habitat.

Then, the main loop starts (Algorithm 3); in each iteration, all of the habitats are selected one by one and subjected to the following operations:

1.
The number of eggs laid by this cuckoo (associated with this habitat) is selected randomly in the range of [Negg_min, Negg_max]; this range should be selected experimentally which is a trade-off between time budget and performance. Indeed, the more eggs, the more fitness function evaluations (FFEs), i.e., much time will be consummed, but better overall performance can be achieved.
2.
The egg-laying radius (ELR) coefficient (α) for this iteration is computed based on the selected α_min and α_max using Eq. (3).
3.
The ELR is calculated for each cuckoo based on her number of eggs using Eq. (1).
4.
Each cuckoo lays her eggs inside her calculated ELR.

In continuum of the current iteration, all the eggs in the same locations (using an ε tolerance) are recognized and killed without any evaluation (just like the description given about the basic COA in Sect. 2). The value of ε tolerance should be investigated experimentally, or one in the basic COA manuscript should be considered [20]. In the following, the survived eggs and mutated cuckoos are subjected to evaluation like the randomly generated habitats using Algorithm 2. Then, all the cuckoos and alive eggs are sorted in ascending order of their produced makespans (fitness values), and the first N_pop number of better habitats is selected as the next population. This population should be decomposed into several clusters, and the best cluster and then the best habitat in the best cluster are selected for the migration phase. In the migration phase, all the cuckoos (habitats) move from their locations toward the best global cuckoo using Eq. (2), and this changes the priority of tasks for each habitat which leads different scheduling orders and makespans.

By means of these operations, the whole population will move toward the best solution and finally converge to an optimal/suboptimal solution. This solution, which is the best one found so far, is the final selected solution of the proposed A-COA approach in the current run.

6.3 The comparison dataset, metrics, and configurations

The proposed approach was implemented on the same desktop computer as the previous experiments with the same configuration. Exceptionally, although the total number of iterations was set to 1000, the algorithm was terminated after n × 100 fitness function evaluations (FFEs), where n is the number of tasks in the given task graph. In addition, if all the populations reach the same point, i.e., the standard deviation (SD) of all the individuals in the population is equal to zero, the algorithm is considered as converged, and hence, is terminated, though there are some unused iterations or EFFs. The other details and configurations of the proposed approach and its counterparts are summarized in Table 10. The proposed approach is evaluated versus not only the strongest heuristic approaches, i.e., MCP [40], ETF [38], and DLS [39], but also metaheuristics like ant colony optimization (ACO) [42] and the basic COA.

Table 10 Configurations of the proposed approach and other competitors

Full size table

6.3.1 The utilized dataset

To do a rational judgment, a set of 125 random task graphs are exploited for comparison study and the evaluations. These random task graphs are with the different shapes based on the three following parameters:

Size (n) The number of nodes in the given task graph. Five different values as size are considered {32, 64, 128, 256, and 512}.
Communication-to-computation ratio (CCR) To show how much a graph is intensive from computational or communicational perspective. The execution time of the nodes in the task graph was randomly chosen from the uniform distribution with mean equal to the specified average computation cost that was 50 time instances. The weight of each edge was also randomly selected from the uniform distribution with mean equal to average computation cost × CCR. Five different values of CCR were considered {0.1, 0.5, 1.0, 5.0, and 10.0}, so that selecting 0.1 makes the generated task graphs computation intensive, while selecting 10.0 makes them communication intensive.
Parallelism The average number of children for each node in the task graph. Increasing this parameter makes the generated graphs fully connected, while lower parallelism makes the generated task graphs dispersed. Five different values of parallelism were selected {3, 5, 10, 15, and 20}.

6.3.2 Comparison metrics

It should be noted that because of the various structural parameters, the achieved makespan for the aforementioned random graphs is in a wide range. Therefore, the normalized schedule length (NSL), which is a normalized metric, is used. It can be computed for every inputted graph by dividing the achieved makespan (schedule length) to a lower bound. Although different lower bounds can be assumed, we chose the sum of weights of the nodes on the computationally critical path of the graph as in (9):

$${\text{NSL}} = \frac{\text{makespan}}{{\sum\nolimits_{{n_{i} \in CP}} {w_{i} } }},$$

(9)

where CP is the set of nodes on the longest computational path inside the given task graph (a computational path is a regular path excluding the communication costs along the path). It is worth to mention that such lower bounds are theoretical and may not always be possible to achieve; i.e., the optimal schedule length is often some larger than this bound.

Another extensively used metric for empirical study in the realm of task scheduling is “speedup” which is the ratio of assigning all the tasks to a single processor (i.e., the aggregation of all the tasks’ execution times) to the obtained schedule length (makespan) using many processors. In other words,

$${\text{Speedup}} = \frac{{\sum\nolimits_{i = 1}^{n} {w_{i} } }}{\text{makespan}}.$$

(10)

Obviously, the lower NSL means the better performance, while the higher speedup is a consequence of the better performance.

6.3.3 Experimental results and comparisons

Figures 15 and 16 show the average NSLs and speedups achieved by all the experiments conducted on entire 125 random task graphs, respectively. Different numbers of processors ranging from two processors to the 64 ones were considered. Utilization of the higher numbers of processors did not made any sensible effect, so we neglected them. It is worse mentioning that each individual experiment was conducted ten times, and the presented results are the average of these independent runs. As a rule, we can say that the final performance ranking is {A-COA, ACO, ETF ≈ COA, MCP ≈ DLS}; that is, the A-COA is the best (especially with the small-scale inputs), ACO is a little bit worse, ETF and the basic COA are moderate and almost identical (their rankings change alternately in the different experiments), and finally, the MCP and DLS are the worst methods from the performance point of view. Figure 17 shows the improvement diagram achieved by the proposed A-COA compared to the basic COA, ACO-based approach, and the other conventional DLS, ETF, and MCP schedulers, where, on average, A-COA achieved 2.70%, 0.94%, 4.35%, 2.84%, 3.05% better performance compared to them, respectively.

7 Conclusion

In this paper, an adaptive version of cuckoo optimization algorithm named A-COA with three novelties in the egg-laying and migration phases was proposed. (1) The egg-laying radius (ELR) was nonlinearly reduced over the iterations for faster convergence, inducing a better balance between exploration and exploitation, especially in the last iterations. (2) After egg-laying phase, those eggs laid in the same locations (with an ε tolerance) are recognized and killed except one of them. While in the basic COA the epsilon coefficient was constant all over the execution, in the A-COA this coefficient was decreased linearly with the iterations. This idea lets compacter populations in the last iterations help with the exploitation and local search for getting exact final solutions. (3) In contrast to the basic COA in which the motion coefficient is constant during the algorithm execution, in the A-COA, we applied an adoptive motion coefficient for each cuckoo based on the cuckoo’s distance to the global best; i.e., the farer cuckoo would get higher coefficient to move forward. This idea, on the one hand, helped the farer cuckoos get trapped in the local minima having a big jump toward the global best and exiting off the stagnation while helping the convergence speed; on the other hand, it slowed down the speed of the nearer cuckoos for a better investigation around the global best in order to improve the accuracy as well as avoid premature convergence.

A set of 20 numerical unimodal and multimodal benchmark functions with various search space structures were considered to evaluate the proposed A-COA. All of these benchmark functions were minimization ones. Since most of these functions have a global minimum in [0, 0, …,0]^T, which can be exploited as a background knowledge for some algorithms, we also used these randomly shift-rotated functions. In addition, a complete list of stochastic optimization algorithms, i.e., not only the evolutionary algorithms such as GA but also the swarm intelligence-based ones such as PSO, ABC, and TLBO, were considered for a rational judgment about the performance of the proposed A-COA. Different sets of experiments were conducted and different results and conclusions were obtained. Generally speaking, the proposed A-COA had 37% and 13% better performance for the original unimodal and multimodal benchmark functions in comparison with the basic COA and 29% and 23% better performance for the shift-rotated unimodal and multimodal ones, respectively. Finally, a total improvement of 25.85% was achieved which is a significant improvement for these algorithms. Moreover, the statistical Wilcoxon rank-sum test conducted over the achieved results certified the superiority of A-COA on most of the functions with 95% of confidence. Meanwhile, the convergence diagrams achieved by all the experiments revealed the superiority of the proposed A-COA versus the basic COA from the convergence rate point of view.

Finally, we investigated the application and performance of the proposed A-COA on multiprocessor task scheduling problem (MTSP). To this aim, first of all, we introduced a discretized version of A-COA and adapted it to MTSP. A set of 125 randomly generated task graphs with different shape parameters, i.e., size, communication-to-computation ratio (CCR), and connectivity (parallelism), were considered to evaluate the performance of the proposed approach. Among the competitors were both heuristic approaches like MCP, ETF, and DLS and metaheuristics like ACO and basic COA. Based on a comprehensive set of experiments on different number of processors from two up to 64 ones, the proposed A-COA-based approach was the sole winner of the race with an excellent performance.

Change history

24 April 2024
This article has been retracted. Please see the Retraction Notice for more detail: https://doi.org/10.1007/s00521-024-09872-y

References

Holland JH (1975) Adaptation in Nature and Artificial Systems. The University of Michigan press, USA
Google Scholar
Storn Rainer, Price Kenneth (1997) Differential evolution–a simple and efficient heuristic for global optimization over continuous spaces. J Global Optim 11(4):341–359
Article MathSciNet Google Scholar
Rechenberg I (1973) Evolutionsstrategie: Optimierung Technischer Systeme nach Prinzipien der Biologischen Evolution. Frommann-Holzboog, Stuttgart
Google Scholar
Fogel LJ, Owens AJ, Walsh MJ (1966) Artificial Intelligence Through Simulated Evolution. John Wiley, New York
Google Scholar
Farmer JD, Packard NH, Perelson AS (1986) The immune system, adaptation, and machine learning. Physica D 22(1–3):187–204
Article MathSciNet Google Scholar
Passino Kevin M (2002) Biomimicry of bacterial foraging for distributed optimization and control. IEEE Control Syst 22(3):52–67
Article Google Scholar
Kennedy J, Eberhart R (1995) Particle swarm optimization (PSO). In: Proceedings of the IEEE international conference on neural networks, Perth, Australia, pp 1942–1948
Dorigo M, Vittorio M, Alberto C, Vittorio M (1991) Positive feedback as a search strategy
Eusuff Muzaffar M, Lansey Kevin E (2003) Optimization of water distribution network design using the shuffled frog leaping algorithm. J Water Resour Plan Manag 129(3):210–225
Article Google Scholar
Karaboga D (2005) An idea based on honey bee swarm for numerical optimization (vol. 200). Technical report-tr06, Erciyes university, engineering faculty, computer engineering department
Mirjalili S, Mirjalili SM, Lewis A (2014) Grey wolf optimizer. Adv Eng Softw 69:46–61
Article Google Scholar
Mirjalili Seyedali, Lewis Andrew (2016) The whale optimization algorithm. Adv Eng Softw 95:51–67
Article Google Scholar
Geem ZW, Kim JH, Loganathan GV (2001) A new heuristic optimization algorithm: harmony search. Simulation 76(2):60–68
Article Google Scholar
Rashedi Esmat, Nezamabadi-Pour Hossein, Saryazdi Saeid (2009) GSA: a gravitational search algorithm. Inf Sci 179(13):2232–2248
Article Google Scholar
Simon Dan (2008) Biogeography-based optimization. IEEE Trans Evol Comput 12(6):702–713
Article Google Scholar
Ahrari Ali, Atai Ali A (2010) Grenade explosion method—a novel tool for optimization of multimodal functions. Appl Soft Comput 10(4):1132–1140
Article Google Scholar
Kashan Ali Husseinzadeh (2011) An efficient algorithm for constrained global optimization and application to mechanical engineering design: league championship algorithm (LCA). Comput Aided Des 43(12):1769–1792
Article Google Scholar
Kaveh A, Talatahari S (2010) A novel heuristic optimization method: charged system search. Acta Mech 213(3):267–289
Article Google Scholar
Rao RV, Savsani VJ, Vakharia DP (2011) Teaching–learning-based optimization: a novel method for constrained mechanical design optimization problems. Comput Aided Des 43(3):303–315
Article Google Scholar
Rajabioun Ramin (2011) Cuckoo optimization algorithm. Appl Soft Comput 11(8):5508–5518
Article Google Scholar
Akbari Mehdi, Rashidi Hassan (2016) A multi-objectives scheduling algorithm based on cuckoo optimization for task allocation problem at compile time in heterogeneous systems. Expert Syst Appl 60:234–248
Article Google Scholar
Elyasigomari V, Lee DA, Screen HRC, Shaheed MH (2017) Development of a two-stage gene selection method that incorporates a novel hybrid approach using the cuckoo optimization algorithm and harmony search for cancer classification. J Biomed Inf 67:11–20
Article Google Scholar
Faradonbeh RS, Monjezi M (2017) Prediction and minimization of blast-induced ground vibration using two robust meta-heuristic algorithms. Eng Comput 33:835–851
Article Google Scholar
Bazgosha Atiyeh, Ranjbar Mohammad, Jamili Negin (2017) Scheduling of loading and unloading operations in a multi stations transshipment terminal with release date and inventory constraints. Comput Ind Eng 106:20–31
Article Google Scholar
Li X, Tang K, Omidvar MN, Yang Z, Qin K, China H (2013) Benchmark functions for the CEC’2013 special session and competition on large-scale global optimization. Gene 7(33):1–8
Article Google Scholar
Liang J-J, Suganthan PN, Deb K (2005) Novel composition test functions for numerical global optimization. In: Proceedings 2005 IEEE swarm intelligence symposium, 2005. SIS 2005, IEEE, pp 68–75
Wilcoxon Frank (1945) Individual comparisons by ranking methods. Biom Bull 1(6):80–83
Article Google Scholar
Buyya R (1999) High Performance Cluster Computing: Architecture and Systems, vol I. Prentice Hall, Upper SaddleRiver, NJ, USA
Google Scholar
Wang Ting, Zhiyang Su, Xia Yu, Hamdi Mounir (2014) Rethinking the data center networking: architecture, network protocols, and resource sharing. IEEE Access 2:1481–1496
Article Google Scholar
Cao J, Chan ATS, Sun Y, Das SK, Guo M (2006) A taxonomy of application scheduling tools for high performance cluster computing. Clust Comput 9(3):355–371
Article Google Scholar
Elhoseny M, Yuan X, ElMinir HK, Riad AM (2016) An energy efficient encryption method for secure dynamic WSN. Secur Commun Netw Wiley 9(13):2024–2031
Article Google Scholar
Wang T, Xia Y, Muppala J, Hamdi M (2015) Achieving energy efficiency in data centers using an artificial intelligence abstraction model. IEEE Trans Cloud Comput 1:1–11
Article Google Scholar
Kwok YK, Ahmad I (1999) Benchmarking and comparison of the task graph scheduling algorithms. J Parallel Distrib Comput 59(3):381–422
Article Google Scholar
Adam TL, Chandy KM, Dickson JR (1974) A comparison of list schedules for parallel processing systems. Commun ACM 17(12):685–690
Article Google Scholar
Kruatrachue B, Lewis TG (1987) Duplication scheduling heuristics (DSH): a new precedence task scheduler for parallel processor systems. Technical report, Oregon State University, Report No.: OR 97331, Corvallis
Carolyn MC, Gill H (1989) Automatic determination of grain size for efficient parallel processing. Commun ACM 32(9):1073–1078
Article Google Scholar
Baxter J, Patel JH (1989) The LAST algorithm: a heuristic-based static task allocation algorithm. In: Proceeding of the 1989 international conference on parallel processing, pp 217–222
Hwang JJ, Chow YC, Anger FD, Lee CY (1989) Scheduling precedence graphs in systems with interprocessor communication times. SIAM J Comput 18(2):244–257
Article MathSciNet Google Scholar
Sih GC, Lee EA (1993) A compile-time scheduling heuristic for interconnection-constrained heterogeneous processor architectures. IEEE Trans Parallel Distrib Syst 4(2):75–87
Article Google Scholar
Wu MY, Gajski DD (1990) Hypertool: a programming aid for message-passing systems. IEEE Trans Parallel Distrib Syst 1(3):330–343
Article Google Scholar
Boveiri HR (2010) ACO-MTS: a new approach for multiprocessor task scheduling based on ant colony optimization. In: 2010 International conference on intelligent and advanced systems (ICIAS), Kuala Lumpur, pp 1–5
Boveiri HR (2016) A novel ACO-based static task scheduling approach for multiprocessor environments. Int J Comput Intell Syst 9(5):800–811
Article Google Scholar
Boveiri HR (2017) An incremental ant colony optimization based approach to task assignment to processors for multiprocessor scheduling. Front Inf Technol Electron Eng 18(4):498–510
Article Google Scholar
Boveiri HR, Khayami R (2017) Static homogeneous multiprocessor task graph scheduling using ant colony optimization. KSII Trans Internet Inf Syst 11(6):3046–3070
Google Scholar
Boveiri HR, Khayami R, Elhoseny M, Gunasekaran M (2018) An efficient swarm-intelligence approach for task scheduling in cloud-based internet of things applications. J Ambient Intell Hum Comput. https://doi.org/10.1007/s12652-018-1071-1
Boveiri HR (2014) Assigning tasks to the processors for task-graph scheduling in parallel systems using learning and cellular learning automata. In: Proceeding of the 1st national conference on computer engineering and information technology, Shoushtar, Iran, pp 1–8 (in Farsi)
Boveiri HR (2015) Multiprocessor task graph scheduling using a novel graph-like learning automata. Int J Grid Distrib Comput 8(1):41–54
Article Google Scholar
Boveiri HR (2015) List-scheduling techniques in homogeneous multiprocessor environments: a survey. Int J Softw Eng Appl 9(4):123–132
Google Scholar
Elhoseny M, Abdelaziz A, Salama AS, Riad AM, Muhammad K, Sangaiah AK (2018) A hybrid model of internet of things and cloud computing to manage big data in health services applications. Future Gener Comput Syst 86:1383–1394
Article Google Scholar
Pereira LAM, Papa JP, Coelho ALV, Lima CAM, Pereira DR, de Albuquerque VHC (2017) Automatic identification of epileptic EEG signals through binary magnetic optimization algorithms. Neural Comput Appl. https://doi.org/10.1007/s00521-017-3124-3
Elhoseny H, Mohamed E, Riad AM, Hassanien AE (2018) A framework for big data analysis in smart cities. In: International conference on advanced machine learning technologies and applications, Springer, Cham, pp 405–414
Yuan X, Abouelenien M, Elhoseny M (2018) A boosting-based decision fusion method for learning from large, imbalanced face data set. In: Quantum computing: an environment for intelligent large scale real application, Springer, Cham, pp 433–448
Farouk A, Tarawneh O, Elhoseny M, Batle J, Naseri M, Hassanien AE, Abedl-Aty M (2018) Quantum computing and cryptography: an overview. In: Quantum computing: an environment for intelligent large scale real application, Springer, Cham, pp 63–100
Hwang R, Gen M, Katayama H (2008) A comparison of multiprocessor task scheduling algorithms with communication costs. Comput Oper Res 35(3):976–993
Article MathSciNet Google Scholar
Boveiri HR (2015) An efficient task priority measurement for list-scheduling in multiprocessor environments. Int J Softw Eng Appl 9(5):233–246
Google Scholar
Boveiri HR (2015) Task assigning techniques for list-scheduling in homogeneous multiprocessor environments: a survey. Int J Softw Eng Appl 9(12):303–312
Google Scholar

Download references

Acknowledgements

This research project has been supported by Sama Technical and Vocational Training College, Islamic Azad University, Shoushtar Branch, Shoushtar, Iran.

Author information

Authors and Affiliations

Sama Technical and Vocational Training College, Islamic Azad University, Shoushtar Branch, Shoushtar, Iran
H. R. Boveiri
Faculty of Computers and Information, Mansoura University, Mansoura, Egypt
M. Elhoseny

Authors

H. R. Boveiri
View author publications
You can also search for this author in PubMed Google Scholar
M. Elhoseny
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to H. R. Boveiri.

Ethics declarations

Conflict of interest

There is no conflict of interest between the authors to publish this manuscript.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

This article has been retracted. Please see the retraction notice for more detail: https://doi.org/10.1007/s00521-024-09872-y"

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

About this article

Cite this article

Boveiri, H.R., Elhoseny, M. RETRACTED ARTICLE: A-COA: an adaptive cuckoo optimization algorithm for continuous and combinatorial optimization. Neural Comput & Applic 32, 681–705 (2020). https://doi.org/10.1007/s00521-018-3928-9

Download citation

Received: 29 August 2018
Accepted: 30 November 2018
Published: 15 December 2018
Issue Date: February 2020
DOI: https://doi.org/10.1007/s00521-018-3928-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

RETRACTED ARTICLE: A-COA: an adaptive cuckoo optimization algorithm for continuous and combinatorial optimization

Abstract

Similar content being viewed by others

A New Cuckoo Search

An improved Cuckoo search algorithm for multi-objective optimization

Hybridizing Cuckoo Search with Bio-inspired Algorithms for Constrained Optimization Problems

Explore related subjects

1 Introduction

2 Cuckoo optimization algorithm (COA)

3 The adaptive cuckoo optimization algorithm (A-COA)

3.1 Decreasing the egg-laying radius coefficient nonlinearly

3.2 Managing epsilon coefficient

3.3 Adaptive migration coefficient

4 Implementation details and configurations

4.1 Comparison benchmark functions

4.2 Competitor optimization algorithms for comparisons

5 The achieved results and comparison study

5.1 A-COA versus the basic COA

5.2 Statistical tests

5.3 Convergence speed

6 Multiprocessor task graph scheduling using the proposed approach

6.1 Problem formulation

6.2 Discretization and application of A-COA to MTS

6.3 The comparison dataset, metrics, and configurations

6.3.1 The utilized dataset

6.3.2 Comparison metrics

6.3.3 Experimental results and comparisons

7 Conclusion

Change history

24 April 2024

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation