Walter O. Krawec

 

ENN - Videos

Video 1: Click to play

Description:
This robot uses four binary proximity sensors and learns to follow a wall. Note that there is only a penalty for crashing into a wall and there is no reward for following it. Also each of the four sensors report a binary 1 or 0 if there is a wall or not (there is no distance calculation done by the sensor). Despite this however the ENN evolved a behavior allowing it to hug the wall.

 

Video 2: Click to play

Description:
Robot with two short range IR proximity sensors and one long range proximity sensor attached to a motor. ENN learns to use the long range IR to "scan" for obstacles.

 

Video 3: Click to play

Description:
Two evolving robots interacting with each other. Each has 3 binary obstacle sensors and 4 binary robot proximity detectors (which determine if another robot is nearby in front, back, left, or right).

Robots each begin with a random ENN (evolving neural network) which slowly evolves (in real time) as the robot interacts with its environment. Robots were not directly rewarded for interacting with each other yet developed behaviors causing them to group together.

 

Video 4: Click to play

Description:
Here the white robot has three IR proximity sensors wheras the blue robot has no IR sensors. The blue robot is however able to detect if a robot is in front, behind, to the left, or right. After some time, the white robot learns to use its IR sensors to avoid walls while the blue robot learns to follow the other.

 

Video 5: Click to play

Description:
Demonstrating how a robot can be taught certain tasks. Here we teach a robot to follow a line. We teach the robot simply by "clapping" to guide the robot as to which way to turn. While line following is a simple task, this teaching technique can be applied to many other learning goals. Note: the green circles represent the line detectors of the robot.

 

Video 6: Click to play

Description:
Showing a robot that evolved a method of scanning left and right at certain intervals for a wall. This sweeping behavior occasionally presents itself to varying degrees (in particular, some robots scan more than the one recorded here)

 

Video 7: Click to play

Description:
Showing how a robot can be evolved in stages. The robot begins with only two IR sensors covering the left side. Once these have been learned, we add (in real time without having to reset the robot) two new sensors and the robot learns to incorporate these into its strategy.

 

Video 8: Click to play

Description:
Three robots with three IR sensors. These robots also have the ability to produce two signals: call or reject (green/red circles resp.). When a robot detects a call signal (green), a robot turns towards it. When a robot detects a reject signal (red), a robot moves away from it (or if it is directly in front, the robot pauses for a short time).

The first 30 seconds were recorded near the start of the simulation where most movement is still random. By the 30 second mark however, we see two robots have learned to "follow" each other (the two robots starting at the left of the screen).

The robots accomplish this by having one robot (the leader) emit a reject call to keep the follower at a distance. The follower however emits a call signal to keep the leader nearby. An ENN has control over where and when to emit a call/reject signal. Receiving such a signal however is controlled by a robot's instincts.

The green pellets are "food" sources and provide a positive reinforcement.

 

Video 9: Click to play

Description:
Showing two robots - one using EL learning, the other U Learning. The robot using EL for STM has learned to follow a wall at the start of the video (which begins several minutes after the simulation began). Towards the late middle of the video, the robot changes its strategy and begins simply avoiding walls. At the end, the robot again seems to be using the following strategy. The U learning robot learned to avoid walls completely.

 

Video 10: Click to play

Description:
Showing a graph of the activity of the first 8 hidden neurons in an ENN as it evolves. At the start of the video there are only two hidden neurons (why the other graphs are flat-lined). Towards the end new neurons are created and their activity is graphed.

In our system, each hidden neuron uses a sigmoid activation function with range (0,1). Inputs to the network (from the robot's sensors) are binary 0 or 1. We find it interesting that an ENN develops a spiking pattern in most of its neurons.

Note that delays/pauses in the video are mostly due to a robot's reflex running which pauses an ENN's operation. Also note that this only shows the first 8 neurons in a list - some of them may be disabled and later re-enabled as the ENN develops. Typically robots of this configuration (3 input IR sensors, 2 output motors) evolve to use 10-30 neurons.