Compute the true utility function and the best linear
approximation in
-
A
${10}\times {10}$ world with a single$+1$ terminal state at (10,10). -
As in (a), but add a
$-1$ terminal state at (10,1). -
As in (b), but add obstacles in 10 randomly selected squares.
-
As in (b), but place a wall stretching from (5,2) to (5,9).
-
As in (a), but with the terminal state at (5,5).
The actions are deterministic moves in the four directions. In each
case, compare the results using three-dimensional plots. For each
environment, propose additional features (besides