Modified non-dominated sorting genetic algorithm III with fine final level selection

Gu, Qinghua; Wang, Rui; Xie, Haiyan; Li, Xuexian; Jiang, Song; Xiong, Naixue

doi:10.1007/s10489-020-02053-z

Modified non-dominated sorting genetic algorithm III with fine final level selection

Published: 02 January 2021

Volume 51, pages 4236–4269, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Applied Intelligence Aims and scope Submit manuscript

Modified non-dominated sorting genetic algorithm III with fine final level selection

Download PDF

Qinghua Gu^1,2,
Rui Wang¹,
Haiyan Xie³,
Xuexian Li¹,
Song Jiang² &
…
Naixue Xiong^2,4

761 Accesses
11 Citations
1 Altmetric
Explore all metrics

Abstract

Dominance resistance is a challenge for Pareto-based multi-objective evolutionary algorithms to solve the high-dimensional optimization problems. The Non-dominated Sorting Genetic Algorithm III (NSGA-III) still has such disadvantage even though it is recognized as an algorithm with good performance for many-objective problems. Thus, a variation of NSGA-III algorithm based on fine final level selection is proposed to improve convergence. The fine final level selection is designed in this way. The θ-dominance relation is used to sort the solutions in the critical layer firstly. Then I_SDE index and favor convergence are employed to evaluate convergence of individuals for different situations. And some better solutions are selected finally. The effectiveness of our proposed algorithm is validated by comparing with nine state-of-the-art algorithms on the Deb-Thiele-Laumanns-Zitzler and Walking-Fish-Group test suits. And the optimization objectives are varying from 3 to 15. The performance is evaluated by the inverted generational distance (IGD), hypervolume (HV), generational distance (GD). The simulation results show that the proposed algorithm has an average improvement of 55.4%, 60.0%, 63.1% of 65 test instances for IGD, HV, GD indexes over the original NSGA-III algorithm. Besides, the proposed algorithm obtains the best performance by comparing 9 state-of-art algorithms in HV, GD indexes and ranks third for IGD indicator. Therefore, the proposed algorithm can achieve the advantages over the benchmarks.

Eliminating Non-dominated Sorting from NSGA-III

An improved NSGA-III algorithm based on elimination operator for many-objective optimization

Article 26 July 2017

Using Dominated Solutions at Edges to the Diversity and the Uniformity of Non-dominated Solution Distributions in NSGA-II

Article Open access 08 August 2022

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In engineering practice, people often face problems with multiple objectives, which are called multi-objective problems (MOPs). The MOPs with at least four objectives are informally known as many-objective problems (MaOPs) [1]. And there always exists MaOPs in real life, such as water distribution system design [2], intrusion detection [3], green vehicle routing problem [4]. Multi-objective evolutionary algorithms (MOEAs) is usually used to solve MOPs and MaOPs, which can be divided into decomposition-based [5,6,7,8], indicator-based [9, 10] and Pareto-based [11,12,13] algorithms. However, it should be pointed that the MOEAs confront three challenges when handling MaOPs, namely dominance resistance (DR) phenomenon, dimensional curse and visualization difficulty [1]. To solve the first challenge efficiently, three methods are introduced, including modification on Pareto-dominance relation, indicator-based approach and the enhanced diversity management. The first method suggests modifying the Pareto-dominance relation. The examples include grid dominance [14], θ-dominance [15], fuzzy Pareto-dominance [16], angle dominance [17]. Besides, Bao et al. [18] presented an augmented penalty boundary intersection (APBI) dominance-relation to reach the balance between convergence and diversity. The second way is to replace the Pareto-dominance relation with indicator function for evaluating the quality of solutions. And this method is called the indicator-based approach, including Hypervolume-based MOEAs (SMS-EMOA) [19]. Recently, Li et al. [20] proposed a two-stage algorithm and R2 indicator was used for the first selection with noticeable success. Moreover, the inverted generational distance was used to evaluate the quality of solutions [21]. Even though these methods can deal with MOPs effectively, there are still high computational burdens. The third approach for MaOPs is to enhance diversity management. For example, the NSGA-II [22] algorithm managed the activation and deactivation of the crowding distance to maintain diversity. Another example is the shift-based density estimation (SDE) strategy [23], which utilizes penalties for the non-dominated solutions with poor convergence. In addition, Cai et al. [24] proposed a diversity indicator based on reference vectors to estimate the diversity. The Two-Archive algorithm 2 [25] increased convergence and diversity by adopting two separate archives. And here are some recent solutions [26,26,27,28,30].

As one of the Pareto-based algorithms, NSGA-III [31] had achieved great success in practical application, which replaced the crowding distance operator in the NSGA-II with a clustering operator and used a set of well-distributed reference points to guarantee diversity. Although the NSGA-III algorithm can achieve good diversity, the performance needs to be improved by remedying deficiency or expanding application. Hence, many related studies have been developed. For example, not all reference points may be associated with a well-dispersed Pareto-optimal set in the search process of NSGA-III algorithm. The selection of these reference points with a number of solutions may not take account of all solutions uniformly over the entire Pareto-fronts [32]. Therefore, Jain and Deb proposed A-NSGA-III [32] to overcome this drawback and extended it to solve the constrained problems. θ-DEA [15] based a new dominance relation were presented, aiming to balance the convergence and diversity in many-objective optimization. Moreover, Cai et al. [33] combined clustering with NSGA-III to build a clustering-ranking evolutionary algorithm (crEA). The convergence and diversity of solutions were guaranteed by two archives respectively in NSGA-III-UE and achieved good results [34]. NSGA-III-SE [35] adopted selection and elimination operator to maintain convergence and diversity. Maha [36] integrated ISC-Pareto dominance into C-NSGA-III algorithm to solve the constrained optimization problems. In addition, a novel niche preservation procedure was used in NSGA-III to alleviate the imbalance problem with higher classification accuracy for classes having fewer instances [3]. Yi et al. [37] introduced an improved NSGA-III algorithm with adaptive mutation operator to deal with big data optimization problems. Tavana [38] combined NSGA-III with MOPSO to settle X-bar control charts. Furthermore, a niche-elimination operation and worse-elimination were introduced in NSGA-III to improve convergence [39]. An elimination operator was presented to promote the convergence of NSGA-III algorithm [40].

Although these above algorithms have been successful in some respects, further study is still needed. The NSGA-III algorithm still faces the problem that Pareto dominance cannot provide enough selection pressure. To solve this problem, mating selection and environmental selection (or diversity maintenance mechanism) are two important factors that needed to be considered. For the former condition, the behind idea is that individuals with better convergence should have a better chance of producing offspring. For the latter case, since most individuals are non-dominated solutions in the evolutionary process, it is necessary to weaken the original diversity information or consider other convergence information. Therefore, how to determine the convergence and diversity of individuals and how to integrate convergence information into the diversity maintenance mechanism are two key issues. This paper promotes the convergence of NSGA-III by modifying the environmental selection. We improve NSGA-III algorithm based on fine final level selection and propose NSGA-III-FS algorithm. In the modified algorithm, non-dominated sorting is used to divide the population firstly in environmental selection. Next, a new dominance relation is utilized to classify the individuals in the critical layer. Then the favor convergence and shift-based density estimation are exploited to assess individuals, and some individuals with good performance are selected finally. Compared with the improvement of NSGA-III algorithm in existing literatures, the improved algorithm in this paper has such innovation. Since most algorithms still use fast non-dominant sorting, critical layer selection becomes very important. A more detailed and rigorous solution selection process is adopted, and the solutions are associated with the reference point only at the critical layer, resulting in less computation. Then, a more elaborate case selection process is used to ensure the quality of the solution. Choosing a suitable solution mechanism for each case from different aspects is not well studied by the above developed algorithms. Although more calculations are possible in the selection process, combined with the reference point correlation part, the overall algorithm will not be too many calculations.

The innovation and contribution of this paper are as follows:

(1)
The integration of θ-dominance, favor convergence and density estimation I_SDE index form a fine final level selection mechanism to optimize solutions. In the fine final level selection, the θ-dominance relation mainly subdivides the solutions in the critical layer. The favor convergence and density estimation I_SDE are used for selecting solutions when the number of candidate solutions does not meet or exceeds the demand. The solutions with good convergence are preserved by such a careful solution selection process.
(2)
The fine final level selection mechanism is applied to NSGA-III algorithm to improve convergence. A variant of NSGA-III algorithm is formed by combining fine final level selection with a different normalization process. And the effectiveness of the proposed algorithm is evaluated against multiple experiments.

The rest of this paper is organized as follows. Section 2 reviews the NSGA-III algorithm and certain knowledge that will be used later. Section 3 describes the framework of the proposed algorithm together with details. Experimental designs are listed in Section 4. Experimental results and analysis on many-objective optimization are given in Section 5. Section 6 concludes this paper and describes the future work.

2 Preliminary work

2.1 NSGA-III

Compared to NSGA-II, one major difference of NSGA-III is that a uniform set of reference points is selected to maintain diversity. Figure 1 shows the detailed flow of NSGA-III. During the evolution process, the NSGA-III firstly generates a random initial parent population P_t(size = N) and a set of uniform reference points. The offspring population Q_t(size = N) is obtained through recombination and mutation. Next, the parent population P_tand offspring populationQ_t are combined to get a new population R_t(size = 2 N). The NSGA-III algorithm divides R_t into different non-dominated levels (F₁, F₂, ⋯F_l) using the non-dominated sorting method. Then, it chooses individuals from F₁ layer, and continues withF₂, ⋯, until the size of S_t is equal to or greater than N for the first time. After the selection, a new population S_t (size = N) is constructed, which serves as the parent population P_t + 1 for the next iteration. Generally, S_t only accepts some individuals in the critical layer (i.e. F_l layer) after storing individuals from F₁, ⋯F_l − 1 levels. Finally, the algorithm chooses K = N − |P_t + 1| individuals from F_l level. The following steps are the process of choosing K individuals. (1) The first step is to normalize the objective values of individuals in S_t. (2) It defines reference lines, which are the origin connecting to reference points on the hyperplane. (3) It calculates the perpendicular distances between the individuals in S_t and reference lines. (4) Each individual in S_t is associated with a reference point according to the minimum perpendicular distance. (5) It calculates the niche count for each reference point (i.e. the number of individuals in S_t associated with each reference point). (6) It selects K individuals based on the calculated niche count [31].

2.2 θ-Dominance relation

In this section, some figures and several basic definitions are utilized to describe the θ-dominance relation. Figure 2 shows the process of θ-dominance and Figure 3 shows the d_{j, 1}(x) and d_{j, 2}(x) of θ-dominance visually. Concepts of θ-dominance are defined as follows:

Definition 1:
two solutions x, y ∈ S_t, x is said to θ-dominate y, denoted by x ≺ _θy, iff x ∈ C_j, y ∈ C_j and F_j(x) < F_j(y).
Definition 2:
solution x^∗ ∈ S_t is θ-optimal iff there is no other solution x ∈ S_t dominate x^∗.
Definition 3:
all the solutions are θ-optimal in S_t to form the θ-optimal solution set (θ-OS), and the corresponding mapping of θ-OS to the objective space is the θ-optimal front (θ-OF).

Each reference direction is assigned a θ-value if the reference direction exists only one-dimension value greater than 10⁻⁴, and then the θ-value of this reference direction is set to 10⁶. Otherwise, the θ-value is 5. Characteristics of the θ-dominance relationship and more details are available in [15].

3 The proposed NSGA-III-FS algorithm

Algorithm 1 describes the main loop of NSGA-III-FS. The proposed algorithm is different from NSGA-III in Step 6–7 as shown in Algorithm 1, and the remaining steps are consistent with the original NSGA-III algorithm. To describe the proposed algorithm in detail, we introduce the proposed algorithm step by step in the following.

Step 1: Reference-point generation

In this paper, the reference-point generation is divided into inner layer and outer layer. And this generation method is the same as the original algorithm, so the original NSGA-III [31] can be referred for the detailed descriptions.

Step 2:
Initialization

Like most algorithms, the random population is produced in the first generation and then optimize the population through effective strategies.
Step 3:
Identify ideal point and nadir point

The initialized ideal point and nadir point are assigned the minimum and maximum objective values, and these two points are updated by Step 6.
Step 4:
Produce offspring

The offspring is generated by a generic operator. It contains adaptive simulate binary crossover and polynomial mutation.
Step 5:
Combined population

A combined population is consisting of a parent population and its offspring. Then its size becomes 2 N.
Step 6:
Update the ideal point and the nadir point

The essence of normalization process is to solve the inconsistency of objectives. However, the normalization process of the original NSGA-III is to construct a hyperplane on the transformed objective space. This does not fully reflect the entire Pareto front. Thus, the normalization process used in the proposed algorithm is to construct a hyperplane on the original objective space, this is more reflective for the whole Pareto front.

The main difference between NSGA-III-FS and NSGA-III is the identification of nadir point Z^nad in the normalization. The ideal point Z^∗ represents the minimum objective value found so far. The Z^nad is difficult to be estimated because it takes into account the whole information of Pareto front. Suppose individuals in the population S are needed to be optimized, the extreme point e_j on the objective axis f_j is found by finding the minimum achievement scalarizing function of the solution x ∈ S. The calculation method is the formula (1) [11].

$$ ASF\left(x,{W}_j\right)={\max}_{i=1}^m\left\{\frac{1}{W_{j,i}}\left|\frac{f_i(x)-{Z}_i^{\ast }}{Z_i^{nad}-{Z}_i^{\ast }}\right|\right\} $$

(1)

Where the W_j = (W_{j, 1}, W_{j, 2}, ⋯W_{j, m})^T is the axis direction of objective axisf_j, if i ≠ j, W_{j, i} = 0, else W_{j, i} = 1 in the formula (1), then theW_{j, i} = 0is replaced byW_{j, i} = 10⁻⁶. The $ {Z}_i^{nad} $ is estimated in the previous generation, the extreme pointe_j = f(x). Finally, m extreme points{e₁, e₂, ⋯, e_m}can be obtained after considering m objective axes; and a m-dimensional hyperplane is constructed by those extreme points; a₁, a₂, ⋯, a_mrepresent the intercepts on the hyperplane with directions$ {\left(1,{Z}_1^{\ast },\cdots, {Z}_m^{\ast}\right)}^T,{\left({Z}_1^{\ast },1,\cdots, {Z}_m^{\ast}\right)}^T,\cdots, {\left({Z}_1^{\ast },\cdots, {Z}_{m-1}^{\ast },1\right)}^T $. The $ {Z}_i^{nad} $ is updated with the calculated intercept a_i.

To have an intuitive understanding of the normalization process differences between NSGA-III and NSGA-III-FS. In addition to the detailed description of Step 6 above, the Fig. 4 shows the comparison of the specific calculation and flow chart of NSGA-III, NSGA-III-FS. The Fig. 5 is a schematic diagram of three-dimensional hyperplane for NSGA-III and NSGA-III-FS algorithms.

Step 7:
Environmental selection process

This paper proposes a fine final level selection mechanism for the environmental selection. The design motivation of new selection method is that: For NSGA-III algorithm, the correlation is first made by the shortest distance between solution and reference point after the non-dominated sorting of populationR_t. Then it selects solutions mainly considers the least number of solutions closest to the reference point. This does not take full advantage of individual information. While modifying the environmental selection is a way to improve convergence. Thus, this study designs final level selection to improve the convergence of NSGA-III.

More details about the environmental selection of the proposed NSGA-III-FS algorithm can be found in Algorithm 2. In environmental selection, the non-dominated sorting method is also utilized to sort the populationR_t firstly. The following steps are constructing a populationS_tstarting from F₁until the size of S_tis equal to or greater than N for the first time. If the size of S_t is equal to N, S_tis outputted as the next generation. Otherwise, the fine final level selection is performed from Step 7–7 to Step7–21 in Algorithm 2. The key steps of fine final level selection are described in detail. Principle of fine final-level selection in environmental selection is as follows: To guarantee the quality of solutions, θ-non-dominated sorting is adopted in the critical layer (F_llayer). When the number of candidate solutions in the first θ-non-dominated layer does not meet the requirements, all the individuals in the first θ-non-dominated layer are kept and some solutions with better favor convergence in the rest θ-non-dominated layers are chosen. If the number of candidate solutions exceeds the requirements, the solutions with balance performance in first θ-non-dominated layer are selected. Figure 6 visually illustrates an example of the solution selection about the fine final level selection.

The solutions that using the θ-non-dominated sorting method in the critical layer take advantage of the guidance of reference points. Then, favor convergence fully considers the convergence information and I_SDEconsiders the solution with equilibrium properties. This makes our mechanism more likely to retain better solutions and output satisfactory results.

Step 7-8:
The normalization of population members

For a solution x, the normalized objective value$ {f}_i^{\prime }(x) $can be calculated by the formula (2), f_i(x) is the objective value, $ {Z}_i^{\ast } $ is the ideal point and$ {Z}_i^{nad} $is the nadir point as mentioned above, m is the objective dimension [11].

$$ {f}_i^{\prime }(x)=\frac{f_i(x)-{Z}_i^{\ast }}{Z_i^{nad}-{Z}_i^{\ast }},i\in \left\{1,2,\cdots, m\right\} $$

(2)

Step 7-9:
θ-non-dominated sorting

The principle of θ-dominance relation has already introduced in Section 2. The θ-non-dominated sorting method is based on this relationship.

Step 7-13:
I_SDE indicator calculation

As is known to all, shift-based density estimation (SDE) [23] is a successful strategy to solve the phenomenon of dominance resistance in high-dimensional optimization for the Pareto-based algorithms. Therefore, this advantage is incorporated in the process of choosing solutions. And in order to reduce the calculation, a slight calculation change is made based on the original SDE strategy to form the indicator I_SDE. The I_SDE can be calculated by the formula (3)–(5) [9].

$$ {I}_{SDE}\left(x,y\right)=\sqrt{\sum \limits_{1\le i\le m} sd{\left({f}_i(x),{f}_i(y)\right)}^2} $$

(3)

$$ {I}_{SDE}(x)=\underset{y\in P,y\ precedes\ x}{\min}\left\{{I}_{SDE}\left(x,y\right)\right\} $$

(4)

$$ \mathrm{where}\kern1.25em sd\left({f}_i(x),{f}_i(y)\right)=\left\{\begin{array}{c}{f}_i(y)-{f}_i(x), if{f}_i(x)<{f}_i(y)\\ {}0, otherwise\end{array}\right. $$

(5)

Where y precedes x means the original position of y in the population P is smaller than x. x and y are two different solutions.f_i(x),f_i(y)represent the objective value of x, y respectively. The solution with largerI_SDEvalue is selected potentially.

Step 7-16:
Favor convergence calculation

The favor convergence (FC) function is based on Chebyshev and preference weights. It is designed to measure the convergence of individuals. For an individual x, the favor convergence value is calculated by the formula (6) [41].

$$ FC(x)=\underset{1\le j\le M}{\mathit{\max}}\left\{{w}_{x,j}\left|{f}_j(x)-{Z}_j^{\mathrm{min}}\right|\right\} $$

(6)

Where M is the number of objectives,$ {z}^{\mathrm{min}}=\left({z}_1^{\mathrm{min}},{z}_2^{\mathrm{min}},\cdots, {z}_M^{\mathrm{min}}\right) $ is the ideal point for a population, w_x = (w_{x, 1}, w_{x, 2}, ⋯, w_{x, M}) is the favorable weight for x and see the formula (7) [41] for the calculation. Small FC value means better convergence.

$$ {w}_{x,j}=\left\{\begin{array}{c}0,\kern8.25em {f}_j(x)={Z}_j^{\mathrm{min}}\\ {}1,\kern0.5em {f}_j(x)\ne {Z}_j^{\mathrm{min}},\exists {f}_i(x)={Z}_i^{\mathrm{min}}\\ {}\frac{1}{f_j(x)-{Z}_j^{\mathrm{min}}}\left[\sum \limits_{i=1}^M\frac{1}{f_i(x)-{z}_i^{\mathrm{min}}}\right]\begin{array}{cc},& otherwise\end{array}\end{array}\right. $$

(7)

4 Experimental designs

4.1 Test problems

The widely used Deb-Thiele-Laumanns-Zitzler (DTLZ) [42] and the Walking-Fish-Group (WFG) [43] test suits are selected for experiment. Table 1 lists the main characteristics of all benchmark functions. Tables 2 and 3 list the mathematical formulas of DTLZ and WFG test suits respectively.

Table 1 The main characteristics of all test problems

Modified non-dominated sorting genetic algorithm III with fine final level selection

Abstract

Similar content being viewed by others

Eliminating Non-dominated Sorting from NSGA-III

An improved NSGA-III algorithm based on elimination operator for many-objective optimization

Using Dominated Solutions at Edges to the Diversity and the Uniformity of Non-dominated Solution Distributions in NSGA-II

Explore related subjects

1 Introduction

2 Preliminary work

2.1 NSGA-III

2.2 θ-Dominance relation

3 The proposed NSGA-III-FS algorithm

Step 1: Reference-point generation

4 Experimental designs

4.1 Test problems

4.2 Comparing algorithms and parameter settings

4.3 Performance metrics and experimental environment

5 Experimental results and analysis

5.1 Strategies validation and analysis

5.2 Algorithm performance analysis

5.3 Comparison of existing modified NSGA-III algorithms

6 Conclusion and future work

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s note

Appendix 1

Appendix 1

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation