Here are a couple of sample (estimated) value functions, both using radial basis functions for state representation.
The first one is for an agent in a 1D world with -1 reward everywhere except on the left and right side:
This second one is for an agent in a 2D world with -1 reward everywhere except at the 4 corners:
No comments:
Post a Comment