... (TM)1
http://www.irobot.com/consumer/roomba_technology.cfm
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... things2
Physical objects. Ssssss. They hurts us, my precious.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... lead3
Only.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... component.4
The world simulator might, for example, be used in a MondoSoft game system offering.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... measure.5
The enterprising student will be able to show that minimizing trial length is equivalent to maximizing average reward received for goal-seeking agents who receive one unit of reward at the GOAL STATE and zero elsewhere.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... average.6
Because the START and GOAL STATEs may be randomly selected, it is only possible to think about the average performance of the AGENT - some TRIALs will inevitably be shorter or longer simply because the START and GOAL are randomly selected to be closer or further. But overall, these fluctuations will average out and the AGENT should be able to reach its goal more quickly.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
... non-trivial7
Non-trivial here means ``reasonably mixed and vaguely like a real outdoor environment''. Cases like ``one solid band of Grass followed by one solid band of Mud followed by...'' are ruled out. Essentially, if you look at the map of the environment and think ``that could never possibly happen under natural circumstances'', then it's probably a trivial environment.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.