Setting of Control Parameters of SOMA on the Base of Statistics

Čičková, Zuzana; Lukáčik, Martin

doi:10.1007/978-3-319-28161-2_12

Zuzana Čičková⁴ &
Martin Lukáčik⁴

Part of the book series: Studies in Computational Intelligence ((SCI,volume 626))

889 Accesses
1 Citations

Abstract

Evolutionary techniques are generally considered to be effective tool for solving a wide range of optimization problems. However, those algorithms are controlled by a special set of parameters according to their type. Control parameters of self-organizing migrating algorithm (SOMA) can be divided into several groups: the stopping parameters, parameters which depended on the type of problem to be solved and finally, parameters that are responsible for the quality of the results. The values of some parameters are directly evident from the nature of the algorithm, but the values of some may vary based on the problem and their efficient settings may significantly affect the quality of the calculation. This chapter focuses on the possibility of using some statistical methods to determine the effective values of some parameters of SOMA. The use of statistical methods is elucidated by an illustrative example.

Access provided by Autonomous University of Puebla. Download chapter PDF

A Novel Variant of Self-Organizing Migrating Algorithm for Global Optimization

Self-organizing migrating algorithm: review, improvements and comparison

Article Open access 04 April 2022

Nelder-Mead and Non-uniform Based Self-organizing Migrating Algorithm

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Introduction

Evolutionary algorithms are successfully used for solving optimization problems of different types. Their limitation is caused by the fact that they are controlled by special set of parameters. Some of these parameters can be successfully set exogenously based on the philosophy of the algorithm, however, there is a no deeper theoretical base to adjust certain parameters (e.g. parameters determining the rate stochastics), whilst (im)proper setting can radically affect the quality of obtained results.

Based on the various tests one can conclude that SOMA is even more sensitive to the parameters setting than other algorithms [3]. The control parameters are usually set on the basis of experimental results [3, 5]. Some of the control parameters are given directly by the nature of the problem and can be changed only by its reformulation. An example of such a parameter is the dimensionality (Dim). Setting other parameter can be derived from simple geometric interpretation of SOMA. Such parameter is the parameter PathLength, where its recommended setting is 3–5. Parameters PopSize and Migrations determine “the size and length” of simulation and their settings can use philosophy “more is better” (however, increasing these parameters affect the time needed to calculation and thus is dependent on the user’s hardware). The parameter MinDiv can be set to e.g. negative value if it is desired to reach all iterations, or to positive number if one want to watch the convergence of the calculation. Parameters Step and PRT are also responsible for the quality of the results. This chapter is devoted to some statistical methods that may be helpful in clarifying their settings. To adjust the control parameters it can be suitable before final calculating to carry out several simulations with e.g. smaller population size and lower number of iterations (which are not time consuming) with different values of the other control parameters. Further on, except basic descriptive statistics (e.g. average, mode, median), which allow to acquire the initial idea of the parameters settings, also various statistical methods can be used, e.g. single and multiple-factor analysis of variance.

The chapter is divided as follows. The first part is devoted to the theoretical description of some statistical methods. The second part gives an illustrative example of setting of control parameters.

2 Single and Multiple-Factor Analysis of Variance—Theory

Analysis of variance (ANOVA) is a technique, which enables to identify if there is any difference between groups on some variable (so called factor). When two or more groups are being compared, the characteristic that distinguishes the group from one another is called the factor under investigation. Consider the evolutionary techniques; an experiment might be carried out to compare different values of control parameters of algorithm from the perspective of obtained value of the fitness of the best individual (usually value of objective function).

Further on, the following notation will be used: a population is the set of all observations of interest and a sample is any subset of observations selected from the population. Let N be the total number of observation in the data set. Consider k levels of factor under investigation and a sample for each factor level, so that the sample size by jth factor level, j = 1, 2, …k is designate as n _j, $ \sum\nolimits_{j = 1}^{k} {n_{j} } = N $. Then, the ith observation for each jth factor level can be designated as x _ij, j = 1, 2, …k, i = 1, 2, … n _j. Whether the null hypothesis of a single-factor analysis of variance should be rejected depends on how substantially the samples from the different populations differ from one another. Let µ _j, j = 1, 2, …k be a mean of population group on corresponding factor level.

A single-factor analysis of variance problem involves a comparison of all k group means. The objective is to test the null hypothesis (H ₀):

$$ H_{0} {:}\,\mu_{1} = \mu_{2} = \cdots = \mu_{k} $$

(1)

against alternative hypothesis (H _a):

$$ H_{\alpha } {:}\,\text{at least two of the}\; {\mu_{{_{j} }}}^{,} {\text{s}}, \quad j = 1, 2, \ldots k,{\text{ are different}} $$

(2)

A measure of disparity among the sample means is the between-group sum of squares, denoted by SSB and given by

$$ SSB = \sum\limits_{j = 1}^{k} {n_{j} } \left( {\bar{x}_{j} - \bar{\bar{x}}} \right)^{2} $$

(3)

where $ \bar{x}_{j} $ is the sample mean of jth group and $ \bar{\bar{x}} $ is the overall mean (ratio of sum of all observations to the total number of observations in the data set). SSB has an associated degree of freedom (df ₁ = k − 1).

A measure of variation within the k samples, called error sum of squares and denoted by SSE, is given by

$$ SSE = \sum\limits_{j = 1}^{k} {\left( {n_{j} - 1} \right)s_{j}^{2} } $$

(4)

where $ s_{j}^{2} $ is the sample variance of jth group. SSE has an associated degree of freedom (df ₂ = N − k).

Total sum of squares, denoted by SST, is given by

$$ SST = \sum\limits_{j = 1}^{k} {\sum\limits_{i = 1}^{{n_{j} }} {\left( {x_{ij} - \bar{\bar{x}}} \right)^{2} } } $$

(5)

with associated degree of freedom (df = N − 1).

The relationship between those three sums of squares is called the fundamental identity and for single-factor analysis of variance is SST = SSB + SSE.

A mean square is a sum of squares divided by its degree of freedom. In particular:

between-group mean square: $ MSB = \frac{SSB}{k - 1} $
within-group mean square: $ MSE = \frac{SSE}{N - k} $.

The test statistic (F) of the single-factor analysis of variance has a Fisher distribution and it is given by the formula: $ F = \frac{MSB}{MSE} $.

The validity of the analysis of variance test requires some assumptions. Peck et al. [4] present these ones:

1.
Each of the k group or population distributions is normal.
2.
The k normal distributions have identical standard deviations.
3.
The observations in the sample from any particular one of the k groups or populations are independent of one another.
4.
When comparing group or population means, k random samples are selected independently of one another.

The statistical significance of the F ratio is most easily judged by its P-value. If the P-value is less than 0.05, the null hypothesis of equal means is rejected at the 5 % significance level. This does not imply that every mean is significantly different from every other mean. It only implies that the means are not all the same.

All the sums of squares, degrees of freedom, mean squares and F ratio with its P-value are entered in a general format of an analysis of variance table (Table 1).

Table 1 General format for an analysis of variance table

Setting of Control Parameters of SOMA on the Base of Statistics

Abstract

Similar content being viewed by others

A Novel Variant of Self-Organizing Migrating Algorithm for Global Optimization

Self-organizing migrating algorithm: review, improvements and comparison

Nelder-Mead and Non-uniform Based Self-organizing Migrating Algorithm

Keywords

1 Introduction

2 Single and Multiple-Factor Analysis of Variance—Theory

3 Parameters Setting of SOMA

4 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this chapter

Cite this chapter

Download citation

Share this chapter

Publish with us

Search

Navigation