Value function approximation: Cosmos — All that is, or was, or ever will be

Value function approximation

cosmos 21st July 2017 at 2:49pm

Fit Value functions with a function approximator (often a parametric one), as in Supervised learning. This reduces the memory and time requirements to learn them, and solve the Reinforcement learning problem.

Types of function approximators

Incremental methods

Batch methods

Experience replay