Policy evaluation

cosmos 15th July 2017 at 7:13pm
Reinforcement learning

The task of calculating a Value function for a Policy, in a Reinforcement learning problem.

Model-based

Using dynamic programming: iterative solution of Bellman expectation equation (as in Policy iteration)

Model-free

See Model-free reinforcement learning