An effective parallel evolutionary metaheuristic with its application to three optimization problems

Amirghasemi, Mehrdad

doi:10.1007/s10489-022-03599-w

An effective parallel evolutionary metaheuristic with its application to three optimization problems

Published: 05 July 2022

Volume 53, pages 5887–5909, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

An effective parallel evolutionary metaheuristic with its application to three optimization problems

Download PDF

Mehrdad Amirghasemi ORCID: orcid.org/0000-0001-8466-1380¹

357 Accesses
4 Citations
1 Altmetric
Explore all metrics

Abstract

This paper presents a parallel evolutionary metaheuristic which includes different threads aimed at balancing exploration versus exploitation. Exploring different areas of the search space independently, each thread also communicates with other threads, and exploits the search space by improving a common high quality solution. The presented metaheuristic has been applied to three famous and hard-to-solve optimization problems, namely the job shop scheduling, the permutation flowshop scheduling, and the quadratic assignment problems. The results of computational experiments indicate that it is effective, versatile and robust, competing with the-state-of-art procedures presented for these three problems. In effect, in terms of solution quality, and average required running time to reach a high quality solution, the procedure outperforms several state-of-the-art procedures on multiple benchmark instances.

Parallel Multiobjective Evolutionary Algorithms

Clustering Based Parallel Many-Objective Evolutionary Algorithms Using the Shape of the Objective Vectors

A Parallel Genetic Algorithm with Three-Parent Crossover for Real Parameter Optimization

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Nature-inspired and evolutionary models have been exploited for solving a variety of hard-to-solve computational problems. One of the central benefits of these models is the facilitation of using parallel processing, which is the simultaneous usage of more than one processor cores in executing different parts of the same program. The key point with designing parallel processing in evolutionary computation is twofold. On the one hand, a program needs to be divided so that separate cores, without interfering with each other, can work together. On the other hand, the cores should communicate the best results they produce so that each core can use the experience of the other cores in evolving solutions.

With respect to this twofold consideration, this paper presents an evolutionary metaheuristic, in which different semi-independent threads, each running on a separate CPU core, work separately and interact occasionally with one another to inform each other about the best overall solution obtained. It is this best overall solution that becomes the focus of the extra search and its vicinity is searched by all the threads. According to the classification made by [24], the proposed parallel strategy can be considered as a co-operative multi-search.

A thread operates in three separate layers including (i) a heuristic construction module to generate initial solutions, (ii) a genetic algorithm module to combine high quality solutions, and (iii) an enhancing module to further improve the solutions. The enhancing module not only improves the best solution obtained by the corresponding thread, but enhances the best overall solution obtained by all other threads as well. The presented procedure, called the Parallel 3-layer Hybrid (P3H), considers each thread as the combination of six synergetic components, namely (i) an initial solution construction method, (ii) a crossover operation, (iii) a mutation operation, (iv) a restrictedTabu component, (v) a large neighborhood scheme, and (vi) a perturb mechanism.

To explore the search space independently and effectively, a thread uses its components in a randomized manner. The benefit of such randomization is twofold. First, it circumvents the possible search redundancy which could occur because all the threads are aimed at improving the same high quality solution. Second, even for a single thread, the randomization prevents it from doing the same set of operations if a solution has been encountered more than once.

The key contribution of this study is to show the effectiveness of this metaheuristic through synergetic integration of these six modules for three famous and hard-to-solve optimization problems, namely the job shop scheduling, the permutation flowshop scheduling, and the quadratic assignment problems. The rest of this paper is organized as follows. Section 2 discusses the related work for each of the problems. Section 3 presents the formulation of these three problems. The stepwise description of the P3H is outlined in Section 4, and the results of computational experiments are in Section 5. In Section 5, also the setting of parameters for the P3H with respect to each of the three problems is discussed. Concluding remarks and some suggestions for future work are described in Section 6.

2 Related work

The three problems on which the P3H is applied are Permutation Flowshop Scheduling Problem (PFSP), Job shop Scheduling Problem (JSP), and Quadratic Assignment Problem (QAP). While theses problem formulations are explained in detail in the next section, the related work is outlined here. In effect, this section is divided into two subsections. The first subsection describes related work to the components of the P3H, and the second subsection presents the related work to parallel methods for the PFSP, JSP, and the QAP.

2.1 Related work to the components of the presented method

The most effective strategies for hard-to-solve combinatorial optimization problems are categorized into the three groups of (i) constructive methods, (ii) local search techniques, and (iii) population-based methods. Constructive methods build a solution by sequentially deciding the values of solution components, i.e. decision variables. Imitating the survival of living creatures is the key idea used in the design of a popular constructive method called Ant Colony Optimization (ACO) [27]. The Rollout algorithm [15] is another popular constructive method capable of producing high quality solutions.

Unlike construction methods, local searches take a complete solution to a problem and by checking its immediate neighbors, which are similar solutions with one or two minor difference, aim to find an improved solution [71]. Getting stuck in local optima is the main problem with local search techniques and has been depicted in Fig. 1.

The problem with local optimality has been tried to be addressed in different ways. Whereas in the Iterated Local Search (ILS), the starting solution of the local search is derived by a perturbation of the previous local optimum found [44], in the Variable Neighborhood Search (VNS) a set of neighborhoods of different orders are employed [35]. On the other hand, Tabu Search (TS) explicitly exploits short-term and long-term memory to guide the search [33], with short-term memory being used to keep track of recently visited solutions and long-term memory to monitor the search progress.

Genetic Algorithms [36], Particle Swarm Optimization [22], and scatter search [33] are prime examples of population-based methods. Population based methods are composed of five main components, namely (i) an encoding/decoding scheme that maps every solution (phenotype) to a chromosome(genotype), (ii) a fitness function that assigns a goodness to each individual, (iii) a parent selection strategy which determines which individuals are nominated as parents to produce offspring, (iv) a survival selection strategy in which a rule is defined for deciding which individuals will be survived to the next generation, and (v) reproduction operators, which specify the way two or more encodings are combined to produce an offspring encoding.

Towards making population-based methods more effective, “go with the winners” strategy [5], population based incremental Learning (PBIL) [14], and path relinking and scatter search [34] concepts are instrumental. In effect, path relinking has been successfully applied to the QAP [2] and the PFSP [53].

2.2 Parallel methods for the PFSP, JSP, and QAP

Since the early development of parallel processing technology in mainframes, many researchers have concentrated on solving hard-to-solve problems through integrating parallelization techniques with metaheuristics. In [23], a recent overview of parallel metahueristics was presented, and the promising performance of parallel cooperative strategies has been emphasized. In cooperative strategies, semi-independent procedures occasionally synchronize by sharing some information during the search progress. These methods are typically referred to as cooperative multi-search in the literature, and their success in tackling hard, NP-complete optimisation problems are further stressed in [4, 65, 67], and [24]. More recently, new techniques on adopting parallelization using Graphical Processing Units (GPUs) for well known metahuristics such as ACO and GA has been proposed [20, 50].

Single Program Multiple Data (SPMD) and Threads models could be seen as two commonly used parallel programming models employed in metaheursitcs. SPMD is a high-level parallel programming model in which all independent tasks run their copy of the same program simultaneously, with different input data or initial points. In the case of metaheuristics, this initial starting point could simply be a different seed value for the pseudorandom number generators. The threads model, on the other hand, is a type of shared memory programming, in which a single process can have several concurrent execution paths. Independent threads can also communicate with one another through a global (shared) memory. This is in line with parallel cooperative strategies, as the knowledge of search space can be shared and utilized by all threads.

It should be noted that multithreading is not necessarily equivalent to parallel computing. In this study, however, a threads model is used, and implemented using OpenMP API [21]. Provided that the number of threads is less than or equal to the number of CPU cores, OpenMP API typically ensures that each thread is run on a separate CPU core [21].

From the early beginning of parallel processing technology, interested researchers have focused on solving the PFSP, JSP, and QAP, as three highly-applicable and challenging problems through multithreaded and multi-core-based procedures. In the following subsections, these parallel procedures are surveyed for each of these problems, separately.

2.2.1 Parallel methods for the PFSP

As a source of parallelism, islands models in evolutionary searches can exchange individuals through the entire run of the algorithm, with this exchange of individuals being generally termed as “migration” [25, 73, 74]. Among several island model GAs proposed for the PFSP, we can mention the one presented in [59], where the authors conducted experiments on randomly generated instances with 40–100 jobs and 4–10 machines. Another island model is in [17], and has been presented for the FSP with total completion time criterion. In this island model, crossover operator is performed on individuals from different islands. The authors conducted experiments on Taillard’s benchmarks [63].

The parallel simulated annealing of [75] is another related work. The authors suggest an island model parallel strategy in which cooperation occurs when the global best solution is being updated. The authors compared the independent and cooperative variant of the proposed procedure with NEH heuristic of [46] and concluded that the cooperative variant with four processors yields better results.

A parallel Tabu search has also been presented by the same authors [16] where the search threads cooperate by broadcasting the global best solution, wherein a specific thread is responsible for managing the exchange of global best solutions. The authors experimented on a selection of Taillard’s benchmark instances [63] as well as some randomly generated instances.

Among the more recent parallel approaches are that of [70] and [52]. In [70] a cooperative island model has been proposed wherein, similar to our approach, same algorithm is run on different islands and occasional cooperation occurs at different stages in the algorithm. In the parallel hybrid proposed by [52], different allocations of Memetic Algorithms and the Iterated Greedy (IG) procedure [56], to multiple threads, have been studied.

2.2.2 Parallel methods for the JSP

Among the earliest parallel strategies, we can mention the parallel tabu search presented in [64], where the author presents a tabu search method and describes why it is more efficient than the shifting-bottle-neck procedure. In the same paper, a parallel variant of the tabu search has also been presented that divides a problem into k sub-problems, each containing a sub-set of jobs. Each of these sub-problems is then solved independently and, in the final stage, the sub-problems are aggregated to form a solution to the original problem.

Aiex et al. [3] present a hybrid of GRASP and path-relinking which operates on a pool of elite solutions. In effect, GRASP is used to generate a pool of elite solutions and path-relinking is applied to (i) a solution produced by GRASP and (ii) an elite solution chosen from the pool. The path-relinking result is used to update the pool of elite solutions. Also, two similar parallel variants have been proposed by [3], namely collaborative and non-collaborative. While the non-collaborative version is a SPMD approach in which each thread executes a copy of the algorithm, in the collaborative version the pool of elite-solutions is shared among threads. The authors also describe why their collaborative scheme presents a better speed-up factor than their non-collaborative scheme.

Another related work is that of [58] in which a parallel Variable Neighborhood Search (VNS) is proposed for the JSP. The authors’ proposed VNS consists of two main components, namely shake and LocalSearch procedures. Whereas shake procedure has the role of perturbing a given solution, the employed LocalSearch procedure improves the given solution based on SWAP or INSERTION neighborhoods. Therefore, the proposed VNS of [58] consists of three main steps in each iteration: (i) constructing an initial solution, x, (ii) performing the shake procedure on x, and storing the result as x^′ (iii) performing the LocalSearch on x^′.

Sevkli and Aydin [58] compared four different parallelization of their proposed VNS algorithm. In the first scheme, a copy of VNS is run by all processors, starting with a single initial solution. After the completion of VNS by all processors, the best solution found among all processors acts as the initial solution for the next iteration. While the first scheme waits for all threads to complete their VNS before starting the next generation, in the second scheme the parallelization strategy is asynchronous and as soon as a processor finishes its VNS run, its next iteration will start with the incumbent overall best solution at that time. The two other parallel schemes proposed by the authors are decentralized in the sense that the search threads communicate through a network of processors and no central synchronization is occurred. Two network structures proposed are: unidirectional-ring, and mesh. The authors argued that the ring topology has the best performance over all other schemes.

2.2.3 Parallel methods for the QAP

One of the earliest and famous procedures for the QAP is the Robust Tabu Search (RTS) developed in [62]. In that paper, the author presents two parallelization schemes for the proposed RTS: In the first scheme, the process of evaluating all potential neighbor solutions is divided among different processors for the purpose of reducing the computation time. In effect, after each processor evaluates its portion of neighborhood, all threads synchronize at a single point to identify the best possible neighbor. The other proposed parallel strategy of [62] is the SPMD approach of running RTS instances independently from different initial solutions.

Another related parallel procedure to our proposed algorithm is the Cooperative Parallel Tabu Search (CPTS) of [38], which incorporates the RTS of [62]. The CPTS is essentially an SPMD parallel algorithm in which a copy of RTS is run in each thread, and each thread stops and synchronizes before and after running the RTS procedure. In particular, the cooperation between processors occurs by maintaining a small set of elite solutions, named reference set. Basically, the number of solutions in the reference set is equal to the number of processors and the reference set is shared among all processors.

Among the other parallel strategies for the QAP, we can mention the parallel hybrid of ant-colony and tabu search [66], and the island model parallel genetic algorithm of [69]. In both methods, a master-slave paradigm is used and the global information regarding best solutions in islands and/or pheromone trail matrix is communicated among the processors.

The other related work is the single instruction multiple data tabu search (SIMD-TS) of [79], in which tabu search procedures are run on each thread, and probabilistically, certain diversification and intensification actions are performed in each thread. It is worth noting that the SIMD-TS has specifically been designed to be run on GPUs instead of ordinary CPUs. An extensive review of recent exact and heuristic methods for the QAP can be found in [29]

3 Problem formulation

The PFSP, JSP, and QAP can be successively described as follows. Since the PFSP is a special case of the Flowshop Scheduling Problem (FSP), the FSP needs to be discussed first.

The FSP is a subclass of scheduling problems in which n Jobs have to be processed on m machines, with the goal of finding an optimal processing sequence of jobs on machines. The optimality criterion is mainly the completion time of the last operation on the last machine (makespan). In the FSP, each job should be processed on the same sequence of machines. For instance, if job 1 should be done on machine 4, 2, 3, and 1, one after another, then all other jobs have the same order. The PFSP is a special case of the FSP in the sense that all machines have to process all the jobs in the same order. In the PFSP, assuming that the jobs are numbered 1,2,…,n, the goal is to find an optimal permutation of jobs π₁,π₂,…,π_n so that the completion time of the last job on the last machine is minimized. The completion time of job π_i on machine j, C(π_i,j), can be calculated as follows.

$$ C(\pi_{i},j)=\max{\{C(\pi_{i-1},j),C(\pi_{i},j-1)\}}+T(\pi_{i},j) $$

(1)

where C(π₀,j) = C(π_i,0) = 0 and T(i,j) is the processing time of job i on machine j.

The PFSP has diverse applications in manufacturing and has increasingly attracted the attention of researchers to assess new algorithmic ideas. The PFSP is NP-hard [54, 55] and even some instances with a moderate number of jobs and machines have not been solved to optimality.

The second problem is the JSP. In the JSP, m jobs have to be processed on n machines and, unlike in the FSP, each job may have different processing order on machines. However, the same as in the FSP, each machine can process only one job at a time and each job can be processed only on one machine at a time. The goal is to find a schedule which minimizes the time required to process all jobs on all machines, i.e. the makespan. The JSP is one of the hardest combinatorial optimization problems [32]. An standard, well-known mathematical formulation for the JSP is disjunctive graph formulation, which can be found in [1].

The QAP is in the class of Facility-Location problems, with diverse applications in different areas such as factory layout design as well as the problem of placing electronic components in a circuit or microchip. In the QAP, we are given a set of locations and a set of facilities, both having the same size, n. In addition, two n × n matrices, D and F are given as input, with d_kl indicating pair-wise distance between locations k and l, and f_ij specifying pair-wise flow between facilities i and j. The objective is to find a one-to-one mapping between facilities and locations which minimizes the cost of flow, C, calculated as a summation of pair-wise flow between facilities multiplied by their distance.

A solution to the QAP can be simply represented as a permutation of facilities. For example, the permutation π = (4,3,1,2) represents the solution in which facilities 4, 3, 1, and 2, are placed in locations 1, 2, 3, and 4, respectively. Belonging to the class of NP-hard problems, the QAP is computationally demanding even with respect to finding a solution with the guarantee of being in a given distance of the optimal solution. With π representing the a solution and π denoting the set of all possible permutations, the QAP objective function formulation is as follows.

$$ \min_{\pi \in {\varPi}}{C(\pi)}=\sum\limits_{i=1}^{n}\sum\limits_{i=1}^{n}{f_{ij}d_{\pi(i)\pi(j)}} $$

(2)

It is worth noting that P3H could be applied as a general purpose metaheuristics, and any fitness or cost function and set of constraints could be adopted. Furthermore, any constraint in the given problem, could be seen as defining the feasible search space. In this paper, we identify a good solution, as the one having lower cost, i.e. a minimization problem, and the constraint are satisfied by considering only valid permutations, as feasible solutions.

4 The P3H

The P3H employs multi-threading, and in each thread a construction technique improves the quality of genomes in a genetic algorithm, with effective exploration/exploitation balance being achieved through an unrestricted and egalitarian parent selection. Also, restricted and elitist offspring selection contributes to the aforementioned balance. The threads are not fully independent in the sense that in each thread, the employed local search improves the best overall solution obtained in all threads. The three interacting layers of the P3H are depicted in Fig. 2, and the detailed stepwise description of the P3H has been shown in Fig. 3.

It should be noted that P3H can be simply reproduced for similar optimisation problems by (i) implementing the general, problem-independent modules, namely heuristic construction, genetic algorithm, and enhancing module, and (ii) incorporating six problem-specific modules. While the general modules are described next, the problem-specific components, outlined in Table 1, are explained in detail in the following subsections.

Table 1 The selection of P3H problem-specific modules for the PFSP, JSP, and QAP

An effective parallel evolutionary metaheuristic with its application to three optimization problems

Abstract

Similar content being viewed by others

Parallel Multiobjective Evolutionary Algorithms

Clustering Based Parallel Many-Objective Evolutionary Algorithms Using the Shape of the Objective Vectors

A Parallel Genetic Algorithm with Three-Parent Crossover for Real Parameter Optimization

Explore related subjects

1 Introduction

2 Related work

2.1 Related work to the components of the presented method

2.2 Parallel methods for the PFSP, JSP, and QAP

2.2.1 Parallel methods for the PFSP

2.2.2 Parallel methods for the JSP

2.2.3 Parallel methods for the QAP

3 Problem formulation

4 The P3H

4.1 Problem-specific modules for the PFSP

4.2 Problem-specific modules for the JSP

4.3 Problem-specific modules for the QAP

5 Computational experiments

5.1 Comparing the initial solution construction methods

5.2 Parameter settings

5.3 Analyzing the effect of parallelization

5.4 Comparison with other metaheuristics

6 Concluding remarks

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation