Abstract
While the handling of optimization variables directly expressed by numbers (continuous, discrete, or integer) is abundantly investigated in the literature, the use of nominal variables is generally overlooked, despite its practical interest in plenty of scientific and industrial applications. For example, in civil engineering, the designers of a structure made out of beams might have to select the best cross-section shapes among a list of available geometries (square, circular, rectangular, etc.), which can be modeled by nominal data. Therefore, in the context of single- and multi-objective evolutionary optimization for mixed variables, this study investigates three genetic encodings (binary, real, and real-simplex) for the representation of mixed variables involving both continuous and nominal parameters. The comparison of the genotypes combined with the instances of crossover is performed on six analytical benchmark test functions, as well as on the multi-objective design optimization of a six-storey rigid frame, showing that for mixed variables, real (and to a lesser extent: real-simplex) coding provides the best results, especially when combined with a uniform crossover.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
Keywords
1 Introduction
Real-life engineering applications are often characterized by data of versatile natures. Formally, design variables in parameterization/optimization can be divided into five classes:
-
continuous variables are defined over an interval \(I_c \subseteq \mathbb {R}\) (e.g. length, curvature radius);
-
discrete variables are continuous variables available only within a finite set \(I_d = \{d_1,\dots ,d_n\}\) where all \(d_i \in \mathbb {R}\) (e.g. cross-section areas from a catalog of beam profiles). It is interesting to note that gradient-based optimizers can be adapted to discrete variables, as developed for instance by [3] through dual formulations and subgradient-based algorithms;
-
integer variables are defined over an interval \(I_i \subseteq \mathbb {N}\) or \(\mathbb {Z}\) (e.g. number of holes in a plate). In engineering problems, they differ from discrete variables by the fact that no intermediate values between two integer variables can be defined (e.g. a plate can contain two or three holes, but not \(2.5\)), which has an impact both on the simulation and optimization sides. Binary variables are a particular case of integer variables;
-
ordinal categorical variables, or simply ordinal variables, take their values (called attributes) among non-numerical values endowed with a ranking, as in the set {‘tiny’, ‘small’, ‘medium-sized’, ‘large’, ‘huge’};
-
nominal categorical variables, or simply nominal variables, are non-numerical variables characterized by no explicit ordering, as the shape of a beam profile: , the choice of a material: {‘steel’, ‘aluminum’, ‘titanium’}, etc.
According to the nature of the variables, distinct fields of optimization have been developed; they can be roughly classified as continuous versus combinatorial optimization. Interestingly, due to their flexibility in data representation, evolutionary optimization algorithms are ideally suited when both types of data are involved. Additionally, since the categorical variables are characterized by non-numerical entries, they represent a challenging issue for the parameterization—and eventually for the optimization—since they require a careful investigation of the conversion procedure onto a chromosome. The optimization algorithms proposed in the literature for mixed variables have been summarized by the authors in [8], confirming that bio-inspired algorithms are an efficient option for single- and multi-objective optimization, as illustrated for example by [12]. However, a systematic examination of data structures for mixed variables is still missing.
Based on all these considerations, the goal of this paper is to investigate three genotypes (binary, real, and real-simplex) in combination with three types of crossover (line, uniform, and two-site) in order to extract information about the mutual interaction between encoding and genetic operators.
The paper is organized as follows: first, a systematic representation of mixed variables is introduced in Sect. 20.2. Then, the genetic operators used in the evolutionary algorithms are described in Sect. 20.3. Afterwards, six analytical test cases including continuous and nominal variables are thoroughly studied (Sect. 20.4), followed by the multi-objective design optimization of a rigid frame (Sect. 20.5). Finally, the conclusions and future prospects are discussed in Sect. 20.6.
2 Data Structures for Mixed Variables
Historically, the initial conversion of design variables to genetic encoding has been done through binary coding [4, 9], as illustrated in Fig. 20.1. After conversion, the converted binary strings for all variables can seamlessly be concatenated in a chromosome, regardless of the various natures of the actual variables.
Another popular and straightforward encoding consists in modeling all variables through real values, as shown in Fig. 20.2.
While these operations are straightforward for numerical values (not only continuous, but also discrete and integer), they require an arbitrary mapping for nominal variables, as shown in Table 20.1.
Indeed, since there is no intrinsic ordering between attributes, both mappings mentioned in Table 20.1 are legitimate, but they might not exhibit the same behavior in the genetic algorithm. Therefore, another conversion is proposed here for nominal values, constraining the attributes of a nominal variable to be equidistant. This constraint can be ensured by assigning to each of the \(n\) attributes the coordinates of the vertex of a regular simplex in a \((n-1)\)-space, as depicted in Fig. 20.3. Largely used in learning and classification theory [1] with categorical data, this approach was first introduced by the authors for approximation purposes [7].
To summarize, three encodings are analyzed in this paper:
-
1.
binary;
-
2.
real;
-
3.
real-simplex, i.e. conversion to real numbers for continuous, discrete, integer, and ordered categorical variables, and regular simplex mapping for the nominal parameters.
Since the relation between coding and crossover is critical in genetic algorithms, the next section lists the types of crossovers investigated in this study.
3 Genetic Operators for Mixed Variables
In all three genotypes proposed here above, the conversion finally leads to a real-valued vector (the chromosome), on which three popular types of crossover can be applied [2]:
-
1.
line crossover (LCX): for each pair of parents (\(p1\), \(p2\)) selected among the population of the previous generation, the offspring are created as follows:
$$\begin{aligned} \left\{ \begin{array}{lll} b_i^{(c1)} &{}= &{}b_i^{(p1)} + \alpha _1 \left( b_i^{(p2)} - b_i^{(p1)} \right) \\ ~ \\ b_i^{(c2)} &{}= &{}b_i^{(p1)} + \alpha _2 \left( b_i^{(p2)} - b_i^{(p1)} \right) \end{array} \right. \end{aligned}$$(20.1)where \(b_i\) are the components of the chromosome (a bit or a real value depending on the coding), \(p1\), \(p2\) refer to the parents, \(c1\), \(c2\) to the children, and \(\alpha _1\), \(\alpha _2\) are random numbers uniformly sampled in \([0,1]\), and constant for the whole chromosome. Geometrically speaking, the children are generated on the line joining the parents (in a space depending on the data structure, viz. the type of coding);
-
2.
uniform crossover (UCX): each child \(c\) is generated as follows:
$$\begin{aligned} b_i^{(c)} = \left\{ \begin{array}{ll} b_i^{(p1)} &{}\mathrm{{if}}~\alpha < 0.5 \\ ~ \\ b_i^{(p2)} &{}\mathrm{{if}}~\alpha \ge 0.5 \end{array} \right. \end{aligned}$$(20.2)where \(\alpha \) is sampled independently for each component of the chromosome;
-
3.
two-site crossover (TSX): the children are generated by swapping pieces of parental chromosomes between two sites randomly chosen within the chromosome.
After the crossover operation, if values are found that do not correspond to available values (depending on the coding and/or crossover implemented, this problem can happen for discrete, integer, or categorical variables), they are repaired in the chromosome by replacing the wrong value by the closest existing value. The remainder of the paper is devoted to an empirical analysis of the interaction between data structure and crossover on several test cases. Nevertheless, the following conditions can already be devised, and serve eventually as guidelines to analyze the results obtained:
-
Condition of exploration: the crossover should be able to produce children different from their parents.
-
Condition of invariance: the children generated from crossover have to be independent from the ranking of the attribute values.
4 Analytical Benchmark Test Cases
The first six test functions aim at showing the relation between genotypes and types of crossover for real, nominal, and mixed (real-nominal) variables. Their analytical expression is detailed in Table 20.2. For each test function, three situations are examined: (1) ten real variables, no nominal variables (\(nz=10\), \(nc=0\)), (2) ten nominal variables, no real variables (\(nz=0\), \(nc=10\)), and (3) five real variables, five nominal variables (\(nz=5\), \(nc=5\)).
Since the goal is to compare the interactions between the type of crossover and the data structure (coding), the probability of mutation is set to \(0\) (no mutation). Moreover, a tournament selection is used; the population size is set to \(100\), and the number of generations to \(50\). \(10\) independent runs are performed for each configuration.
The results are collected in Tables 20.3, 20.4 and 20.5, showing the normalized error between the worst and best solutions (averaged over the ten independent runs). In other words, on each line, \(0\) corresponds to the best solution found (in average), while \(1\) refers to the worst.
In the six analytical test functions, it appears that the conversion to real numbers is the most effective for the cases with either continuous or nominal variables only. As intuition would have suggested, the line crossover allows for a better exploration of the design space for continuous variables, while the uniform crossover is more effective with nominal variables, where the swapping of variable values is performed independently and randomly for each variable. The binary coding with line crossover provides the worst results, which could be expected since the linear combination of parental chromosomes is not a meaningful operation for values converted into binary digits. Finally, the real-simplex coding generally provides reasonably good results in comparison with the real coding.
5 Application: Structural Design of a Rigid Frame
To analyze the efficiency of the proposed algorithmic instances on a structural design example, a 3D rigid frame is investigated [11]. The quantities of interest are the mass and the compliance, the latter being post-processed from a finite element linear analysis with beam elements [6]. The loads are derived from Eurocode 3 [10], and consist in:
-
the dead load of the beams and columns;
-
the gravity load on the floors (19.16 kPa);
-
the lateral load due to the wind (110 kN).
The beams or columns are classified in five groups of common cross-sections (as depicted in Fig. 20.4):
-
group 1: \(\{4, 5, 9, 10, 14, 15, 18, 21, 24, 28, 29, 33, 34, 38, 39, 42, 45, 48\}\);
-
group 2: \(\{49, 51, 52, 54, 55, 57, 58, 60, 62\}\);
-
group 3: \(\{50, 53, 56, 59, 61, 63\}\);
-
group 4: \(\{1, 2, 3, 6, 7, 8, 11, 12, 13, 25, 26, 27, 30, 31, 32, 35, 36, 37\}\);
-
group 5: \(\{16, 17, 19, 20, 22, 23, 40, 41, 43, 44, 46, 47\}\).
Ten design variables are necessary to parameterize a given structure:
-
for each of the five groups of profiles, a categorical variable \(c\) defines the cross-section geometry among seven attributes: ;
-
for all groups of profile, one continuous bounded variable defines the maximum length \(l\) of the cross-section (either height or diameter) with 0.09 m \(\le \) l 0.11 m, and with a fixed thickness (when applicable) set to 0.0025 m. For the rectangular cross-section, the width is defined as half of the height; for the —section, the width is equal to the height.
The geometry of the cross-section is typically a nominal variable, since no ordering of the available cross-section types can be made a priori. The choice of the cross-section has a direct impact on the calculation of the quantities (area, moments of inertia) necessary to get the normal efforts, shear forces, and bending moments.
The multi-objective implementation of the genetic algorithm is based on the second version of the Nondominated Sorting Genetic Algorithm, or NSGA-II [5]. The corresponding Pareto fronts obtained for the nine possible combinations of coding and type of crossover are depicted in Fig. 20.5.
In this problem, the uniform and two-site crossovers clearly outperfom the line crossover to obtain dense and nondominated Pareto fronts. In particular, the uniform crossover provides comparable results for all codings.
In terms of designs obtained, a close examination of the Pareto set reveals that the optimal cross-sections are mostly circular or I-shaped , and square to a lesser extent. The tubular shapes constitute the best compromise between lightness and stiffness.
6 Conclusions and Future Prospects
In this paper, three genotypes, along with three types of crossover, are analyzed in order to extract their properties in the modeling of mixed (continuous \(+\) nominal) variables. For the analytical test functions, the real coding furnishes the best results in almost all configurations (real, nominal, or mixed variables), while the real-simplex performs fairly, and the binary coding poorly. The type of crossover also has a significant influence on the results, and this aspect is mostly visible in the multi-objective design optimization of a rigid frame, where the uniform crossover consisting in randomly swapping genes in the parental chromosome is the most effective one to find dense and widely distributed Pareto fronts.
Future studies are guided by the need for better accounting for the relations between attributes of nominal variables. In the simplex mapping representation of nominal variables, the hypothesis that all attributes are equidistant is acceptable when no information is known a priori about their correlations with the output quantities of interest (objectives and constraints). However, when knowledge about the physics is available, a non-regular simplex might be preferable, with distinct pairwise distances between attributes. For instance, the square and circular cross-section shapes of beam profiles might exhibit a closer behavior than the I-shaped profiles, which might be taken into account directly in the coding. This crucial step (under investigation) requires a clustering phase to identify these pairwise relationships to be performed before the optimization process (see Fig. 20.6), by using physical insight from the input-output model.
References
Agresti A (1996) An introduction to categorical data analysis. Wiley, New York
Bäck T, Fogel DB, Michalewicz Z (1997) Handbook of evolutionary computation. Oxford University Press, New York
Beckers M (2000) Dual methods for discrete structural optimization problems. Int J Numer Meth Eng 48:1761–1784
De Jong KA (ed) (2006) Evolutionary computation: a unified approach. MIT Press, Massachusetts
Deb K, Pratap A, Agarwal S, Meyarivan T (2002) A fast and elitist multiobjective genetic algorithm: NSGA-II. IEEE Trans Evol Comput 6(2):182–197
Ferreira AJM (2009) MATLAB codes for finite element analysis: Solids and structures., solid mechanics and its applicationsSpringer Science+Business Media B.V., The Netherlands
Filomeno Coelho R (2012) Extending moving least squares to mixed variables for metamodel-assisted optimization. In: 6th European congress on computational methods in applied sciences and engineering (ECCOMAS 2012), Vienna, Austria, 10–14 September
Filomeno Coelho R (2013) Metamodels for mixed variables based on moving least squares-application to the structural analysis of a rigid frame. Optim Eng
Goldberg DE (1989) Genetic algorithms in search, optimization and machine learning. Addison-Wesley Longman, New York
Papadrakakis M, Lagaros ND, Plevris V (2005) Design optimization of steel structures considering uncertainties. Eng Struct 27:1408–1418
Richard Liew JY, Chen H, Shanmugam NE, Chen WF (2000) Improved nonlinear plastic hinge analysis of space frame structures. Eng Struct 22(10):1324–1338
Tang X, Bassir DH, Zhang W (2011) Shape, sizing optimization and material selection based on mixed variables and genetic algorithm. Optim Eng 12:111–128
Acknowledgments
This work has been supported by Innoviris (Brussels-Capital Region, Belgium) through a BB2B project entitled “Multicriteria optimization with uncertainty quantification applied to the building industry”. The authors also acknowledge support by the Basic Project Foundation of Northwestern Polytechnical University (JC20120241), and by the National Natural Science Fundation of China (Grants No.11302173 and No.51275424).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2015 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Filomeno Coelho, R., Xiao, M., Guglielmetti, A., Herrera, M., Zhang, W. (2015). Investigation of Three Genotypes for Mixed Variable Evolutionary Optimization. In: Greiner, D., Galván, B., Périaux, J., Gauger, N., Giannakoglou, K., Winter, G. (eds) Advances in Evolutionary and Deterministic Methods for Design, Optimization and Control in Engineering and Sciences. Computational Methods in Applied Sciences, vol 36. Springer, Cham. https://doi.org/10.1007/978-3-319-11541-2_20
Download citation
DOI: https://doi.org/10.1007/978-3-319-11541-2_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-11540-5
Online ISBN: 978-3-319-11541-2
eBook Packages: EngineeringEngineering (R0)