Keywords

1 Introduction

Our model continues the works by Hinton and Nowlan and Mayley [1, 2], who simulated some features of interaction between learning and evolution. We use also the quasispecies model [3, 4] and our estimations of the evolutionary rate and the efficiency of evolutionary algorithms [5].

2 Description of the Model

We consider the evolving population of modeled organisms. Each organism has the genotype and the phenotype. We assume that the genotype and the phenotype of the organism have the same form, namely, they are chains; symbols of both chains are equal to 0 or 1. The length of these chains is equal to N. For example, we can assume that the genotype encodes a modeled DNA chain, symbols of which are equal to 0 or 1, and the phenotype is determined by the neural network of the organism, the synaptic weights of the neural network are equal to 0 or 1 too. These weights are adjusted by means of learning during the organism life.

The evolving population consists of n organisms, genotypes of organisms are SGk, k = 1,…,n. The organism genotype SGk is a chain of symbols, SGki, i = 1,…,N. N, n >> 1, 2N >> n. The values N and n do not change during evolution. Symbols SGki are equal to 0 or 1. The evolutionary process is a sequence of generations. The new generation is obtained from the old one by means of selection and mutations. Genotypes of organisms of the initial generation are random. Organisms inherit the genotypes from their parents, these genotypes do not change during the organism life and are transmitted (with small mutations) to their descendants. Mutations are random changes of symbols SGki. The evolutionary process is similar to that of in the quasispecies model [3, 4].

Phenotypes of organisms SPk are chains of symbols SPki, k = 1,…,n, i = 1,…,N; SPki = 0 or 1. The organism receives the genotype SGk at its birth. The initial phenotype of the organism at its birth is equal to the organism genotype: SPk(t = 1) = SGk. The lifetime of any organism is equal to T. The time is discrete: t = 1,…,T. T is the duration of the generation. The phenotype SPk is modified during the organism life by means of learning.

It is assumed that there is the certain optimal chain SM, which is searched for in processes of evolution and learning. Symbols SMi of this chain SM are also equal to 0 or 1; the length of the chain SM is N. For a concrete computer simulation, the chain SM is fixed; symbols of this chain are chosen randomly.

Learning is performed by means of the following method of trial and error. In every time moment t, each symbol of the phenotype SPk of any organism is randomly changed to 0 or 1, and if this new symbol SPki coincides with the corresponding symbol SMi of the optimal chain SM, then this symbol is fixed in the phenotype SPk, otherwise, the old symbol of the phenotype SPk is restored. So, during learning, the phenotype SPk moves towards the optimal chain SM.

At the end of the generation, the selection of organisms in accordance with their fitness takes place. The fitness of the k-th organism is determined by the final phenotype SPk in the time moment t = T. We denote this chain SFk, i.e. we set SFk = SPk(t = T). The fitness of the k-th organism is determined by the Hamming distance ρ = ρ(SFk,SM) between the chains SFk and SM:

$$ f_{k} = { \exp }\left[ {{-}\beta \rho \left( {{\mathbf{S}}_{{{\mathbf{F}}k}} ,{\mathbf{S}}_{{\mathbf{M}}} } \right)} \right] + \varepsilon , $$
(1)

where β is the positive parameter, which characterizes the intensity of selection, 0 < ε << 1. The role of the value ε in (1) can be considered as the influence of random factors of the environment on the fitness of organisms.

The selection of organisms into a new generation is made by means of the well-known method of fitness proportionate selection (or roulette wheel selection). The probability of the selection of a certain organism into the next generation is proportional to its fitness. The choice of an organism into the next generation takes place n times, so the number of organisms in the population at all generations is equal to n.

Thus, organisms are selected at the end of a generation in accordance with their final phenotypes SFk = SPk(t = T), i.e. in accordance with the final result of learning, whereas genotypes SGk (modified by small mutations) are transmitted from parents to descendants.

Additionally, similar to Mayley [2], we take into account the learning load, namely, we assume that the learning process has a certain burden for the organism and the fitness of the organism can be reduced under the influence of the load. For this purpose, we consider the modified fitness of organisms:

$$ f_{mk} = { \exp }\left( { - \alpha d} \right){\{ }{ \exp }\left[ {{-}\beta \rho \left( {{\mathbf{S}}_{{{\mathbf{F}}k}} ,{\mathbf{S}}_{{\mathbf{M}}} } \right)} \right] + \varepsilon \} , $$
(2)

where α is the positive parameter, which takes into account the learning load, d = ρ(SGk,SFk) is the Hamming distance between the initial SPk(t = 1) = SGk and the final SPk(t = T) = SFk phenotypes of the organism, i.e. the value d characterizes the intensity of the whole learning process of the organism during its life.

3 Results of Computer Simulation

Two modes of operation of the model are considered: (1) the regime of the evolution combined with learning, as described above, (2) the regime of “pure evolution”, that is the evolution without learning. The parameters of the model at the simulation are chosen in such manner that the evolutionary search is effective (the experience of the work [5] at this choice was used): N = 100, β = 1, Pm = N −1 = 0.01, n = N = 100, T = 2, ε = 10−6, α = 1 (Pm is the probability of change of any symbol SGki at mutations).

The results of the simulation are averaged over 1000 or 10000 calculations. This averaging ensures the good accuracy of the simulation; typical errors are smaller than 1–2%.

3.1 Comparison of Regimes of Pure Evolution and Evolution Combined with Learning

Figure 1 shows the dependence of the average (for the population) Hamming distance ρ = ρ(SGk,SM) between the genotypes SGk of organisms in the population and the optimal chain SM on the generation number G. The curve 1 characterizes the regime of evolution combined with learning; the curve 2 characterizes the regime of pure evolution. The fitness of organisms is determined by the expression (1). We can see that the pure evolution without learning (the curve 2) does not optimize organisms Sk; whereas evolution combined with learning (the curve 1) obviously ensures the movement towards the optimal chain SM.

Fig. 1.
figure 1

The dependence of the average Hamming distance <ρ> = <ρ(SGk,SM)> between genotypes SGk and the optimal chain SM on the generation number G. The curve 1 characterizes the regime of evolution combined with learning; the curve 2 characterizes the regime of pure evolution.

To understand, why the pure evolution does not ensure a decrease of the value ρ, let us estimate the value of the fitness (1) in the initial population. The Hamming distance ρ = ρ(SGk,SM) for initial genotypes is of the order of N/2 = 50, therefore, exp(−ρ) ~ 10−22 and exp(−ρ) << ε. This means that all organisms of the population have approximately the same value of the fitness fk ≈ ε. Consequently, the evolutionary optimization of genotypes does not occur in the case of the pure evolution. Thus, the movement towards SM occurs only in the presence of learning; this movement leads to the decrease of the value ρ.

Consider the effect of the acceleration of the evolutionary process by learning (the curve 1 in Fig. 1). Analysis of the results of simulations shows that the gradual decrease of the values ρ = ρ(SGk,SM) occurs as follows. First, the learning shifts the distribution of organisms n(ρ) on the value ρ towards smaller ρ, so the values ρ = ρ(SFk,SM) become small enough, such that exp[−ρ(SFk,SM)] is of the order of ε. Consequently, the fitnesses of organisms in the population in accordance with (1) become essentially different; so, organisms with small values ρ(SFk,SM) are selected into the population of the next generation. It is intuitively clear that the genotypes of SGk of selected organisms should be rather close to the final phenotypes SFk (obtained as a result of the learning) of these organisms. Thus, the result of the selection is choosing of organisms, which genotypes are also moving to the optimal chain SM. Therefore, values ρ = ρ(SGk,SM) in the new population decrease.

The described mechanism of the genetic assimilation is characterized by Fig. 2, which shows the distributions of the number of organisms n(ρ) for given ρ in the population for different moments of the first generation. The curve 1 shows the distribution n(ρ) for ρ = ρ(SGk,SM) for the initial genotypes of organisms at the beginning of the generation. The curve 2 shows the distribution n(ρ) for ρ = ρ(SFk,SM) for organisms after the learning, but before the selection. The curve 3 shows the distribution n(ρ) for ρ = ρ(SFk,SM) for organisms, selected in accordance with the fitness (1). The curve 4 shows the distribution n(ρ) for ρ = ρ(SGk,SM) for the genotypes of selected organisms at the end of the generation. The genotypes of selected organisms SGk are sufficiently close to the final phenotypes of learned and selected organisms SFk, therefore the distribution n(ρ) for genotypes SGk (the curve 4) moves towards the distribution for final phenotypes SFk (the curve 3). A similar displacement of the distribution n(ρ) towards smaller values ρ takes place in the next generations.

Fig. 2.
figure 2

The distributions n(ρ) in the first generation of evolution for different moments of the generation. The curve 1 is the distribution n(ρ) for ρ = ρ(SGk,SM) for the initial genotypes before the learning. The curve 2 is the distribution n(ρ) for ρ = ρ(SFk,SM) for organisms after the learning, but before the selection. The curve 3 is the distribution n(ρ) for ρ = ρ(SFk,SM) for selected organisms. The curve 4 is the distribution n(ρ) for ρ = ρ(SGk,SM) for the genotypes of selected organisms at the end of the generation.

It should be underlined that the decrease of values ρ at the learning should be sufficiently large in order to ensure the small role of the parameter ε and the significant difference of the fitnesses (1) of organisms after the learning, and therefore, the effective selection of organisms with small values ρ(SFk,SM). In order to guarantee the effective operation this mechanism, the learning should be enough strong.

3.2 Hiding Effect

Thus, the strong learning can accelerate the evolutionary search. However, the strong learning can also prevent a finding of the optimal genotype. The curve 1 in Fig. 1 shows that at large G the decrease of values <ρ> = <ρ(SGk,SM)> is limited: the final value <ρ> remains quite large, the asymptotic value <ρ> is approximately equal to 6.2. This fact is due to the following. At large G (G ~ 1000), the strong learning leads to the finding of the optimal phenotype SPopt = SM independently on the genotype SGk. Therefore, at the final stages of the evolutionary process, the genotypes SGk do not move towards the optimum SM. So, the hiding effect [2] is observed.

3.3 Influence of the Learning Load on the Modeled Processes

We also analyzed the influence of the learning load on the modeled processes. For this case, the fitness of organisms is determined by the expression (2). The simulation is performed for the mentioned parameters (N = n = 100, β = 1, Pm = 0.01, T = 2, ε = 10−6, α = 1). The simulation results show that the learning load leads to the considerable acceleration of the evolutionary search for the optimal chain SM. This acceleration is due to the fact that the learning load results in the more strong selection of organisms that have small distance ρ(SGk,SFk) between the initial SPk(t = 1) = SGk and the final SPk(t = T) = SFk phenotypes, than for the case of the fitness (1). This form of the selection in accordance with the expression (2) leads to the additional minimization of changes of phenotypes SPk during the learning process. The simulation demonstrates that the optimal genotype SGopt = SM in the considered case is found.

The on-line paper [6] describes results of the simulation in more details.

4 Conclusion

The analysis of the simulation shows that (a) the genetic assimilation, (b) the hiding effect, and (c) the significant acceleration of the genetic assimilation and the evolutionary process under the influence of the leaning load are observed under the following assumptions:

  1. (1)

    Each organism of the evolving population has a genotype and a phenotype.

  2. (2)

    The genotype and the phenotype are chains of symbols; the both chains have the same form.

  3. (3).

    Genotypes of organisms are transmitted from parents to descendants with small mutations. The genotype of the organism is not changed during its life.

  4. (4)

    The initial phenotype of the organism at its birth is equal to the organism genotype.

  5. (5)

    There is a certain optimal chain, which is searched for by means of learning and evolution. The optimal chain has the same form as the genotype and the phenotype.

  6. (6)

    The phenotype is essentially adjusted by means of learning during the organism lifetime. During learning, the phenotype moves towards the optimal chain.

  7. (7)

    The selection of organisms into a new generation occurs in accordance with final phenotypes of organisms.