Q500, Intro to Cog Sci, Lecture Notes

Lecture Notes, Week 10

R. Port, Q500, Intro to Cognitive Science - Sean McLennan subbing
Nov. 1/99

Emergence

An Example in Detail: Termites

Without centralized control and detailed knowledge of architecture and physics, how can termites build their nests? (Figures from Kugler & Turvey - pgs 69-89)

Begin with random deposists (Fig. 1)
Pheremones from deposits diffuse - termites have a preference for depositing where concentrations are greatest (Fig. 2)
Preferred deposit sites develop - equidistant because of diffusion patterns (Fig. 3)
Pillars begin to form because top of pile is only active source of diffusion (Fig. 4)
Saddle Point attractors bias deposits to one side of pillars resulting in arches (Fig. 5) (Fig. 6) (Fig. 7)
In large numbers, same phenomenon leads to dome construction (Fig. 8)
Repeat cyclically (Fig. 9)

Begining with randomness (flight paths, deposit sites), based on simple rules (deposit in location of highest pheremone concentration), and basic physics (diffusion of pheremones), a complex structure emerges

"Emergence"

For the most part, "Emergence" is a blanket term (somewhat of a buzzword) that encompasses an enormous range of phenomena. It has also been called "self-organization".

The fact that self-organization / emergence seems to contradict the second law of thermodynamics (entropy) which states that over time there is a tendency for systems to lose energy and complexity, has long been recognized. First was Spencer, a contemporary of Darwin, discussion life and evolution as a contradiction to entropy. Others (Doyne Farmer, Chris Langton) have gone so far as to claim that emergence is a weak-force of the universe on par with entropy. Chris Langton uses his "lambda" function (seen in our CA lab) as a definition of where emergence counteracts entropy.

Examples of Emergent Phenomena:

evolution
life
traffic jams
development
cognition
chess
planetary orbits
butterfly wing patterns
weather
analog computers
the Internet
the economy
social interaction
ant hills

Emergent phenomena literally pervade every aspect of our daily lives

Principles of Emergence (such as they are)

There are a few things that seem common to all emergent phenomena:

"much from little"
agents (atoms, cells, people, cars, termites)
rules (physics, behaviour patterns)
no centralized control
persistence of higher level entities / patterns / structures despite changing components (waves, traffic jams, individual identity, human bodies)
hierarchical organization: emergent structures / patterns / entities can act as agents in higher level phenomena (atoms -> molecules -> cells -> people -> society)

It also seems possible to define two different type of emergent phenomena, direct vs. indirect. Direct emergence is more causative; for example a thermometer. That it can act as a measure of temperature is an emergent phenomenon, but the two factors, temperature and volume of mercury/alcohol, are coupled in a causative relationship. Other examples: traffic jams, sunflowers following the sun, planetary orbits, analog computers. Indirect emergence is mediated by learned behaviour (evolutionarily or otherwise). So pheremones do not move mud and cannot create a termite mound themselves.

Studying Emergence

Emergence is a very difficult thing to study because of its pervasiveness. Some believe that it is impossible to study, and believe that there are no fundamental, underlying rules that govern it. Of course, others disagree.

The primary method of study to this point has been through:

Reduction: reduce the phenomena to their agents and rules. Emphasis is placed on the rules and interactions, which is a departure from the way reduction usually works in science. For example, sound waves: sound waves can be reduced to atomic components. However with sound waves it is true that the "whole is equal to the sum of its parts." This is not true of emergent phenomena.
Modeling: more so than in other areas there is a strong movement to model emergent phenomena. We will be looking at one modeling environment called Starlogo in the lab.

References

Kugler, Peter and Michael Turvey. 1987. Information, Natural Law, and the Self-assembly of Rhythmic Movement. Hillsdale, NJ: Lawrence Erbaum Associates.

Holland, John. 1995. Hidden Order. Reading, MA: Helix Books

Holland, John. 1998. Emergence. Reading, MA: Helix Books

Levy, Steven. 1992. Artificial Life.New York: Vintage Books

Genetic Algorithms: Computing with Emergence

History:

50's and 60's - Evolutionary systems studied with an eye towards optimization of engineering problems.
60's - Field of "Evolution Strategies" (which developed mostly independently of GAs) founded
60's - Genetic Algorithms invented by John Holland. Originally a method of studying adaptation as it occurs in nature.
60's and 70's - Holland and students develop ideas at University of Michigan.

Terminology

Chromosomes: in biology a string of DNA; in GAs a string of 1's and 0's.
Genes: in biology a functional block of DNA; in GAs a position in the string.
Alleles: in biology the possible "setting" of a Gene trait; in GAs possible values of a position (1 / 0).
Crossover: in both biology and GAs recombining genes from two parents.
Mutation: in both biology and GAs random changing of an allele.
Search Space: in both biology and GAs a hypothetical, multidimensional space in which all possible enumerations of variables (solutions to problems) are represented by points in the space. Usually good solutions are characterized by a minima (low point) or maxima (high point) in the space.

Elements of GAs

A population of chromosomes
Selection according to fitness (defined by a "fitness function")
Crossover to produce new offspring
?Random mutation (not required)

Basic Process

Create a random population (20-100 is typical)
Calculate the fitness of each chromosome based on the fitness function
Select the fittest (sometimes using a random, but biassed funtion)
Perform crossover - choose a point in the chromosome and swap one side creating two new chromosomes
Perform mutation (optional)
Repeat (from #2) as needed (usually 20-50 generations)

An Online Example / Tutorial

The following site has an excellent visual tutorial of GAs.

http://cs.felk.cvut.cz/~xobitko/ga/

Applications

Optimization (Engineering problems, Travelling salesman, Prisoner's Dilemma)
Automatic Programming (Daniel Hillis' sorting algorithm for n=16 - also example of coevolution)
Machine Learning (Robokoneko)
Economics (modeling)
Immune Systems (modeling)
Ecology (modeling - especially host-parasite)
Population Genetics (studying factors that make certain genes viable)
Evolution and Learning (individual learning and species evolution)
Social Systems (modeling)
Artificial Life

Schema Theorem: Implicit Category Information

Schemata are essentially similarity templates that describe a set of chromosomes that share values in certain positions. To describe a schema we must add the wildcard “*” to our string notation. Thus, the schema *0 describes a subset of 2 chromosomes: {10, 00}; 1** describes the subset {100, 101, 110, 111}, etc. Of course, schemata with no *’s describe sets of 1 element — i.e. the notion of schemata subsumes individual chromosomes.

The total number of possible schemata given a chromosome length of l is 3^l since there are three possibilities at each position: 1, 0, or *. The chromosomes, whose values are set, are instantiations of 2^l schemata since each position may take its actual value or the wildcard. To see this, let’s examine an example of a short chromosome length, l = 3, for which there are 3^3 = 27 possible schemata. The chromosome 101 is an instantiation of 2^3 = 8 of those 27 schemata: {***, 1**, *0*, **1, 10*, 1*1, *01, 101}.

The important insight is that a single chromosome, in fact, also represents a great number more schemat a — i.e. categories — and thus in some sense, by judging the fitness of that individual, the fitness of each category is also judged. By the same token, the fitness of an individual chromosome is also in a sense a function of the fitnesses of each schema it represents. The fitness of a schema is defined as the average of the fitnesses of all instantiations of that schema in the populat ion. Although this figure is never explicitly calculated in a GA it is implicitly calculated because individual chromosomes are members of a population. It can therefore be seen that in the process of selection, not only are relatively fit individuals selected and mated, but also relatively fit schemata.

Schemata, themselves, can be thought of as instantiations of other schemata. For example, *1**1* is as much an instantiation of ***1* and *1*** as the chromosome 010010. Thus, ***1* and *1*** can be regarded as “building blocks” of *1**1*. Let’s say that *1**1* is a solution to a given problem and any chromosome instantiating that schema would be evaluated as 100% fit. Chromosomes that are instantiations of *1*** and ***1* (but not *1**1*) and thus contain building blocks of the target schema would be evaluated with relatively high fitnesses, 50% say, increasing the likelihood they will be mated together. This in turn increases the likelihood that their building blocks will be combined to evolve the target solution.

Taken in a slightly different light, schemata can be thought of as hypotheses. The string 101 makes 8 implicit “hypotheses” about the solution to the problem; that a 1 in the first position is important to the solution; that a zero in the second position is important to the solutions; that a 1 in the first position and a zero in the second position is important to the solution; etc. The merit of those hypotheses is rated by the fitness function and by comparing the similarities between the chromosomes that have the highest ratings, the hypotheses with the greatest merit eventually emerge.

Implicit parallelism — this power to judge many categories by judging a single member — is the primary power of the GA. One chromosome implicitly represents a number of schemata, and a single evaluation of that chromosome implicitly evaluates all the associated schemata. In the process of crossover and mutation, relatively short schemata are not disrupted and are allowed to propagate through the population from generation to generation, guiding the search through solution space.

The following summarizes the major points of Schema Threorem:

a. A single representation (ex. a chromosome) implicitly contains a huge amount of information about the categories to which it belongs.
b. Processes that act on those representations (ex. selection) can implicitly make use of all that information in parallel.
c. Valuable information (ex. a solution) can emerge from the repetition of the same process.

References

Holland, John. (1975). Adaption in Natural and Artificial Systems. Ann Arbor, MI: University of Michigan Press.

Holland, John. 1995. Hidden Order. Reading, MA: Helix Books

Mitchell, Melanie. (1996). An Introduction to Genetic Algorithms. Cambridge, MA: MIT Press.