First we will provide an environment according to the description as given in 4.4. This is a scilab programm called 'environment.sce' and it contains several functions. One is called [GRID]=gridgen(Y,X)
. This function generates from the two numbers Y and X a two-dimensional grid according to the structure of a wood1-world. In the theory such a function is not necessary because we simply assume some environment as 'given'.
Realizing an environment in the computer induces another problem. If we have more than one agent active in the same environment using a sequential processing we have to define some 'protocol' how we 'simulate' simultaneous actions. The most simple approach is to make before any new action cycle a random ordering and then process every agent one after the other.
To realize this we have to assume that the simulated environment embraces at least two components: (i) a two-dimensional grid
for the 'space' of the environment and (ii) a list of registered agents
with the basic information agent identifier, position, direction, new action, and reward
. Based on these informations it is possible to compute the content of visual input as well as the possible next moves. Thus we will have
![]() |
![]() |
![]() |
(4.51) |
![]() |
![]() |
![]() |
(4.52) |
![]() |
![]() |
![]() |
(4.53) |
![]() |
![]() |
![]() |
(4.54) |
![]() |
![]() |
![]() |
(4.55) |
![]() |
![]() |
![]() |
(4.56) |
![]() |
![]() |
![]() |
(4.57) |
![]() |
![]() |
![]() |
(4.58) |
![]() |
![]() |
![]() |
(4.59) |
![]() |
![]() |
![]() |
(4.60) |
Where the objects
represent the possible objects which can placed in one position of the grid.
Gerd Doeben-Henisch 2012-03-31