Next: The Baldwin Effect and
Up: The Use of Genetic
Previous: Hardware and Software Packages
The first experiment was designed to gauge the effects of
different mutation rates upon the emergence of the Baldwin effect.
3-dimensional input vectors were used to train the SOM.
Data on the effects of each mutation rate was collected from 20
trials of 1,000 generations each. Mutation rates (pm) of
0.001 to 0.01 in increments of 0.001 were used. Then the average
learning iterations per
generation (ALI) was plotted against mutation rate. The value of ALI
plotted was the average of the 20 trials for each value of pm.
In all the runs the
crossover probability (pc) was kept constant at 0.2. The results
are plotted in Figure 2.
Figure 2:
ALI vs. Mutation Probability
 |
We observe in this figure
that average
fitness decreased as the mutation rate increased in a roughly linear
fashion.
Fitting the results to a linear equation using the least square method
yielded the following equation:
A subsequent run of the simulation showed that the equation's
predictions were correct to within 8% of the actual data points.
Other simulations using input vectors of higher dimensionality
produced similar linear equations whose predicted values are
consistently within 10% of the actual data. The equations produced
are given below:
4-dimensional:
5-dimensional:
6-dimensional:
The rates of deterioration of fitness (the slope terms) were
consistently within 2% of each other -- even when the search space
was expanded from 3 to 6 dimensions. In each case,
fitness was found to be a linearly
decreasing function of mutation rate.
The effect of higher crossover rates on the Baldwin
effect is similar to that of increasing the mutation rates. Again
using the least squares approximation algorithm, we arrive at the
following linear fit for the 3-dimensional vector case
(pm = 0.001):
for
.
As in the case of mutation, increasing the probability of crossover
decreases the ability of the network to learn.
In fact, the
individuals with the least number of learning iterations were part
of an asexual population (pc = 0). For 400 trials of 1000
generations each, by far the highest performing population operated
with pc = 0 and pm = 0.001. In other words, most of the
optimization performance of the algorithm seemed to be coming from
straight selection with low mutation rates.
Doing similar simulations with higher dimensional input vectors led to
similar results. The linear equations produced by least square
analysis are as follows:
4-dimensional:
5-dimensional:
6-dimensional:
As before, the behavior of the learning process was very similar in all
the cases without regard to the dimensionality of the input vectors.
We performed one further simulation of this system with a pc = 0.2
and a pm = 0.0. The ALI obtained was 388. This value clearly does not
fall on the line pictured in Figure 2. This value of ALI lies between
those obtained using a pm of 0.003 and 0.004. It is also 50 points
higher than the ALI obtained at pc = 0.2 and a pm = 0.001.
From this we believe it can be concluded that although the Baldwin effect
can operate effectively in an asexual environment its efficiency is
decreased by the absence of a small quantity of mutative pressure.
Next: The Baldwin Effect and
Up: The Use of Genetic
Previous: Hardware and Software Packages
Aaron Konstam
1999-10-04