A diverse human learning optimization algorithm

Wang, Ling; An, Lu; Pi, Jiaxing; Fei, Minrui; Pardalos, Panos M.

doi:10.1007/s10898-016-0444-2

A diverse human learning optimization algorithm

Published: 24 May 2016

Volume 67, pages 283–323, (2017)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Global Optimization Aims and scope Submit manuscript

A diverse human learning optimization algorithm

Download PDF

Ling Wang¹,
Lu An¹,
Jiaxing Pi²,
Minrui Fei¹ &
…
Panos M. Pardalos²

2338 Accesses
25 Citations
Explore all metrics

Abstract

Human Learning Optimization is a simple but efficient meta-heuristic algorithm in which three learning operators, i.e. the random learning operator, the individual learning operator, and the social learning operator, are developed to efficiently search the optimal solution by imitating the learning mechanisms of human beings. However, HLO assumes that all the individuals possess the same learning ability, which is not true in a real human population as the IQ scores of humans, one of the most important indices of the learning ability of humans, follow Gaussian distribution and increase with the development of society and technology. Inspired by this fact, this paper proposes a Diverse Human Learning Optimization algorithm (DHLO), into which the Gaussian distribution and dynamic adjusting strategy are introduced. By adopting a set of Gaussian distributed parameter values instead of a constant to diversify the learning abilities of DHLO, the robustness of the algorithm is strengthened. In addition, by cooperating with the dynamic updating operation, DHLO can adjust to better parameter values and consequently enhances the global search ability of the algorithm. Finally, DHLO is applied to tackle the CEC05 benchmark functions as well as knapsack problems, and its performance is compared with the standard HLO as well as the other eight meta-heuristics, i.e. the Binary Differential Evolution, Simplified Binary Artificial Fish Swarm Algorithm, Adaptive Binary Harmony Search, Binary Gravitational Search Algorithms, Binary Bat Algorithms, Binary Artificial Bee Colony, Bi-Velocity Discrete Particle Swarm Optimization, and Modified Binary Particle Swarm Optimization. The experimental results show that the presented DHLO outperforms the other algorithms in terms of search accuracy and scalability.

A Simple Human Learning Optimization Algorithm

Learning search algorithm: framework and comprehensive performance for solving optimization problems

Article Open access 09 May 2024

A Multi-Populations Human Learning Optimization Algorithm

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Optimization problems widely exist in the real world, and therefore methods used to solve these problems have been being a hot topic. However, optimization problems are becoming more and more complicated with the development of science and technology, and traditional gradient-based methods are inefficient and inconvenient for such problems as they require substantial gradient information, depend on a well-define starting point, and need a large amount of enumeration memory. On the other hand, meta-heuristic algorithms, such as Genetic Algorithms (GAs) [1], Differential Evolution [2], Particle Swarm Optimization (PSO) [3], and Ant Colony Optimization (ACO) [4], have shown better results on various complex problems such as feature selection [5], the design of controllers [6], and the node placement of wireless sensor networks [7]. Encouraged by the achievements of meta-heuristics, more and more researchers devote themselves into the study of the design and application of meta-heuristics.

The well-known No Free Lunch theorem states that any two optimization algorithms are equivalent when their performance is averaged across all possible problems. It hints that some algorithm can be better than the others on a class of problems, which has been demonstrated by previous works. Thus developing new meta-heuristics for solving various problems more efficiently and effectively has drawn more and more attention, and because of the great success of GAs, PSO, and ACO, which are inspired by biological systems, exploring biologically inspired meta-heuristics has been one of hottest topics in evolutionary computation community. During the last decade, varieties of biosystem-based meta-heuristics, such as Artificial Fish Swarm Algorithms (AFSA) [8], Artificial Bee Colony Optimization (ABC) [9], Bat Algorithms (BA) [10], Hunting Search Algorithms [11], Harmony Search (HS) [12], Fruit Fly Optimization Algorithms (FOA) [13], Firefly Algorithms [14], Shuffled Frog-leaping Algorithms [15], and Cuckoo Search [16], have been developed and applied to different problems. As is known to all, human being is the smartest creature in the world because of the most powerful learning ability, and humans are able to tackle a large number of complicated problems that other living beings, such as birds and ants, cannot solve. Therefore, it is natural to presume that the meta-heuristic based on the learning mechanisms of human being may have advantages over other biological systems based algorithms on optimization problems in our daily life. Actually, many human learning activities are similar to the search process of meta-heuristics. For example, people repeatedly study and evaluate the performance of each practice to update their experience for guiding the following study to master a new skill better, which is analog to meta-heuristics iteratively yielding new candidate solutions and calculating the corresponding fitness values for adjusting their following search. Motived by this idea, Wang et al. [17] presented a new meta-heuristic algorithm called Human Learning Optimization (HLO) recently. However, HLO assumes that all the individuals have the same learning ability, which is not true. Herrnstein presented in his famous book “The bell curve” that Intelligence Quotient (IQ) scores followed Gaussian distribution [18], and the previous research results also showed that IQ test scores had significantly increased and would continue to rise with the development of society and technology [19, 20]. Inspired by these facts, this paper proposes an improved HLO algorithm, called Diverse Human Learning Optimization (DHLO), in which the learning ability of individuals follows a Gaussian distribution and dynamically adjusts to improve the search ability of the algorithm.

The rest of the paper is organized as follows. Section 2 presents the concept, operators, and implementation of DHLO in details. Then the parameter study of DHLO is performed and discussed in Sect. 3. Section 4 verifies the performance of DHLO on benchmark functions as well as knapsack problems, and the results are compared with the standard HLO as well as the other eight meta-heuristic algorithms. Finally, conclusions are remarked in Sect. 5.

2 Diverse human learning optimization

Human learning process is extremely complicated of which the study is the part of neuropsychology, educational psychology, learning theory, and pedagogy. For the ease of implementation, DHLO, like HLO [17], uses three learning operators, i.e. the random learning operator, the individual learning operator, and the social learning operator, to update the population and search out the optimal solution, which emulates the behaviors of random learning, individual learning, and social learning in human learning activities. For example, when a person learns to play basketball, he or she may study new skills randomly because of lack of prior knowledge (random learning), learn from his or her former experience (individual learning), and find useful methods from his or her coach or related books (social learning).

DHLO adopts the binary-coding framework, that is, the individual of DHLO is represented as a binary string, in which each bit of solutions is analog to a basic element of the knowledge that humans need to learn. Assuming that there is no prior-knowledge of the problems at the beginning, an individual is initialized with “0” or “1” randomly as Eq. (1)

$$\begin{aligned} x_i =\left[ {x_{i1}\,\; x_{i2}\,\; \ldots \,\; x_{ij}\,\; \ldots \,\; x_{iN} } \right] ,\;x_{ij} \in \{0,1\},1\le i\le M,1\le j\le N \end{aligned}$$

(1)

where $x_{ij} $ is the jth bit of the ith individual, and M and N denote the number of individuals in the population and the length of solutions, respectively.

2.1 Random learning operator

As Cziko [21] presented that human learning was the result of random variation and universal selection, randomness always exists in the process of human learning. At the beginning of learning, humans usually learn by their random acts since there is no prior knowledge of a new problem. With the proceeding of studying, people still perform random learning because of various factors such as forgetting, disturbance, and knowing partial knowledge about problems. Besides, human being keeps exploring new strategies to learn better in which random learning is unavoidable. DHLO performs the random learning operator to mimic these phenomena as Eq. (2),

$$\begin{aligned} x_{ij} =RE(0,1)=\left\{ {\begin{array}{l} 0,\;rand\le 0.5 \\ 1,\;else \\ \end{array}} \right. \;\;\; \end{aligned}$$

(2)

where rand is a stochastic number between 0 and 1.

2.2 Individual learning operator

Individual learning is the ability of humans to gain knowledge through the individual reflection on external stimuli [22]. People memorize the useful experience during their study and use it when they face the same or similar problems and therefore they can avoid mistakes and learn more efficiently. To simulate this learning behavior, each individual in DHLO stores its personal best solutions in the individual knowledge database (IKD) represented as Eq. (3)

$$\begin{aligned} {\textit{IKD}}_i =\left[ {\begin{array}{l} {ikd}_{i1} \\ {ikd}_{i2} \\ \vdots \\ {ikd}_{ir} \\ \vdots \\ {ikd}_{iP} \\ \end{array}} \right] =\left[ {\begin{array}{llllll} {ik}_{i11} &{}{ik}_{i12} &{}\cdots &{}{ik}_{i1j} &{}\cdots &{} {ik}_{i1N} \\ {ik}_{i21} &{}{ik}_{i22} &{}\cdots &{}{ik}_{i2j} &{}\cdots &{}{ik}_{i2N} \\ \vdots &{}\vdots &{}&{}\vdots &{}&{} \vdots \\ {ik}_{ir1} &{}{ik}_{ir2} &{}\cdots &{}{ik}_{irj} &{}\cdots &{}{ik}_{irN} \\ \vdots &{}\vdots &{}&{}\vdots &{}&{} \vdots \\ {ik}_{iP1} &{}{ik}_{iP2} &{}\cdots &{}{ik}_{iPj}&{} \cdots &{}{ik}_{iPN} \\ \end{array}} \right] ,1\le r\le P \end{aligned}$$

(3)

where ${\textit{IKD}}_{i}$ denotes the individual knowledge database of person $i, {ikd}_{ir}$ stands for the rth best solution of person i, and P is the size of IKDs. When DHLO executes the individual learning operator, it chooses a random solution in the IKD and then copies the corresponding value as Eq. (4),

$$\begin{aligned} x_{ij} ={ik}_{irj} \end{aligned}$$

(4)

where r is a random integer.

2.3 Social learning operator

However, when problems become extremely complicated, it would be impossible or very time-consuming for a single person to solve. In a social environment, humans directly or indirectly transfer their knowledge and therefore improve the efficiency and effectiveness of study by social learning [23]. The previous works demonstrate that population-based meta-heuristics have an advantage on complicated problems because of the sharing of knowledge among individuals. Therefore, social learning is simulated in DHLO to enhance the search ability of the algorithm and the best solutions found by all the individuals are archived in the social knowledge database (SKD) as Eq. (5) for sharing experience in the population,

$$\begin{aligned} {\textit{SKD}}=\left[ {\begin{array}{l} {skd}_1 \\ {skd}_2 \\ \vdots \\ {skd}_s \\ \vdots \\ {skd}_Q \\ \end{array}} \right] =\left[ {\begin{array}{llllll} {sk}_{11} &{}{sk}_{12} &{}\cdots &{}{sk}_{1j}&{} \cdots &{}{sk}_{1N} \\ {sk}_{21} &{}{sk}_{22} &{}\cdots &{} {sk}_{2j} &{}\cdots &{}{sk}_{2N} \\ \vdots &{}\vdots &{}&{} \vdots &{}&{} \vdots \\ {sk}_{s1} &{}{sk}_{s2}&{} \cdots &{}{sk}_{sj} &{}\cdots &{}{sk}_{sN} \\ \vdots &{}\vdots &{}&{} \vdots &{}&{} \vdots \\ {sk}_{Q1} &{}{sk}_{Q2} &{}\cdots &{}{sk}_{Qj} &{}\cdots &{}{sk}_{QN} \\ \end{array}} \right] ,1\le s\le Q \end{aligned}$$

(5)

where ${skd}_{s}$ denotes the sth solution in the SKD and Q is size of the SKD. Based on the knowledge in the SKD, DHLO performs the social learning operator to generate a new solution as Eq. (6),

$$\begin{aligned} x_{ij} ={sk}_{sj} \end{aligned}$$

(6)

where s is a random integer.

2.4 Gaussian-distribution and dynamic updating of the learning ability

DHLO, as well as HLO, generates new solutions by performing the random learning operator, the social learning operator, and the individual learning operator. In general, the implementation of these three learning operators can be formulated as Eq. (7),

$$\begin{aligned} x_{ij} =\left\{ {\begin{array}{l} RE(0,1),\;\;\;0\le rand\le p_r \\ ik_{irj} ,\;\;\;\;\;\;\;\;\;pr<rand\le p_i \\ sk_{sj} ,\;\;\;\;\;\;\;\;\;\;else \\ \end{array}} \right. \end{aligned}$$

(7)

where $p_{r}$ and $p_{i}$ are two control parameters used to determine the probabilities of running the operators. Specifically, $p_{r}$ determines the probability of random learning while ($p_{i}-p_{r})$ and ($1-p_{i})$ are the rates of individual learning and social learning, respectively. In the standard HLO these two parameters, i.e. $p_{r}$ and $p_{i}$, are both set as constants and the recommended values are 5/M and $0.85+2/M$ where M is the length of solutions. Therefore, all the individuals of HLO have the same learning capabilities, which is not true in a real human population. For instance, the IQ scores of humans [24], as well as some other factors influencing human learning, follow Gaussian distribution, which results in different learning ability of people, and consequently the scores on an exam usually follow an approximately Gaussian distribution. In addition, Flynn points out that IQ test scores would rise. Inspired by these facts, the Gaussian-distributed learning ability and dynamic adjusting strategy are developed in the DHLO.

Taking a deep insight into the learning operators of HLO, it is obvious that the random learning operator performs a random search in which none of knowledge is taken into account. Considering that only two values, i.e. 0 and 1, exist in binary space, the function of the random learning operator is similar to the mutation operator of Genetic Algorithms. Thus it is sensible that the suggested value of $p_{r}$ is very small since the contribution of the random learning operation is to keep the diversity of the population and perform a local search, otherwise the random search may impair the learning mechanisms of HLO and significantly spoils the performance of the algorithm. Compared with the random learning operator, the individual learning operator and the social learning operator are two main learning operators that update the population according to the individual experience and the knowledge of the population, respectively. Therefore, $p_{i}$ plays a very important role since it directly determines the abilities of individual learning and social learning. For example, if $p_{i} =1$, HLO would lose the ability of social learning and consequently the efficiency and effectiveness of the algorithm is ruined since the advantage from the knowledge sharing does not exist. On the other hand, if $p_{i}=p_{r}$, which means that individual learning is abandoned, HLO would be degraded to a local search around the global best solution. Unfortunately, the optimal $p_{i}$ depends on problems and thus it is almost impossible to set the optimal value without prior knowledge. To tackle this problem, the Gaussian distribution and the dynamic updating of the parameter $p_{i}$ are introduced in DHLO to tune $p_{i}$ and improve the search ability.

First, when initializing the algorithm, each individual of DHLO is given a different personal $p_{i}$ instead of the same one for all the individuals in HLO, which follows Gaussian distribution as Eq. (8),

$$\begin{aligned} p_i \sim N(\mu ,\sigma ^{2}) \end{aligned}$$

(8)

where $\mu $ and $\sigma $ are the mean and standard deviation, respectively. The advantages of using Gaussian distribution are: (1) a majority of values of $p_{i}$ are yielded in the range determined by $\mu $ and $\sigma $, and therefore a fair performance of DHLO can be guaranteed; (2) compared with HLO using the only one value of $p_{i}$, the robustness of DHLO is enhanced by searching with various reasonable $p_{i} $ values; (3) the difference of the performance of individuals will be shown due to using different $p_{i}$ values, which can be used for dynamically updating $p_{i}$ to improve the search ability further.

Then the dynamic updating of $p_{i}$ is executed every DG generations where DG is a pre-defined constant. When performing dynamic updating, $\mu $, i.e. the mean of the Gaussian distribution, is set as Eq. (9)

$$\begin{aligned} \mu =p_i^*\end{aligned}$$

(9)

where $p_i^*$ is the $p_{i}$ value of the individual with the best fitness. The $p_{i}$ value of each individual is adjusted as Eq. (10) if the global optima found by DHLO is updated in the latest DG generations,

$$\begin{aligned} p_{i,j} =p_{i,j} +rand\times (p_i^*-p_{i,j} ) \end{aligned}$$

(10)

where $p_{i,j} $ is the $p_{i}$ value of the jth individual, and therefore the $p_{i}$ of all individuals moves to a better value to improve the performance in the following search. Otherwise, all the values of $p_{i}$ are re-initialized with $\sigma $ and the updated $\mu $.

2.5 Updating of the IKD and the SKD

After a new population is generated, the fitness of candidates is calculated according to the fitness function and used to update the IKDs and the SKD, which is analog to the process that humans evaluate their performance through practicing to refresh their knowledge for further studying. For the updating of the IKD, if the number of solutions in the current IKD is less than P, i.e. the pre-defined size of the IKD, the new candidate will be stored in the IKD no matter of its fitness. Otherwise the new candidate is reserved and used to replace the solution with the worst fitness in the IKD only when it has a better fitness. For the updating of the SKD, the same strategies as the updating of the IKD are applied. However, DHLO only permits to replace one solution in the SKD in each generation to keep the diversity and avoid the premature of the algorithm.

2.6 Implementation of DHLO

In summary, the procedure of DHLO can be concluded as follows:

Step 1: initialize the population randomly, yield the initial values of $p_{i}$ for each individual following Gaussian distribution, and set the other parameters of DHLO such as $p_{r}$ and the maximal generation;
Step 2: calculate the fitness of initial individuals and initialize the IKDs and SKD;
Step 3: yield new candidates by performing the three learning operators as Eq. (7);
Step 4: compute the fitness of all the new solutions;
Step 5: update the IKDs and SKD according to the updating rules;
Step 6: for every DG generations, set the mean $\mu $ of Gaussian distribution as Eq. (9), and then adjust the value of $p_{i}$ of each individual as Eq. (10) if the global optima is updated; otherwise re-initialize the $p_{i}$ of each individual with the updated $\mu $;
Step 7: if the termination conditions are met, output the best solution; otherwise go to step 3.

3 Parameter analysis of DHLO

To apply the strategies of Gaussian distribution and dynamic updating efficiently, a parameter study on these two kinds of operation were carried out, and two functions, i.e. F2 and F9, chosen from the CEC05 benchmark functions [25] were adopted for testing. The characteristics of these two functions as well as the other 13 functions used as benchmarks for evaluating the DHLO in the next section are listed in Table 1.

Table 1 The CEC05 benchmark functions

A diverse human learning optimization algorithm

Abstract

Similar content being viewed by others

A Simple Human Learning Optimization Algorithm

Learning search algorithm: framework and comprehensive performance for solving optimization problems

A Multi-Populations Human Learning Optimization Algorithm

Explore related subjects

1 Introduction

2 Diverse human learning optimization

2.1 Random learning operator

2.2 Individual learning operator

2.3 Social learning operator

2.4 Gaussian-distribution and dynamic updating of the learning ability

2.5 Updating of the IKD and the SKD

2.6 Implementation of DHLO

3 Parameter analysis of DHLO

4 Experimental results and discussions

4.1 Benchmark functions

4.1.1 Low-dimensional functions

4.1.2 High-dimensional functions

4.2 Knapsack problems

4.3 0-1 knapsack problems

4.3.1 Multidimensional knapsack problems

5 Concluding remarks

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation