On-policy learning: Cosmos — All that is, or was, or ever will be

On-policy learning

cosmos 15th July 2017 at 8:25pm

Model-free reinforcement learning methods for which the sampling policy is the same as the policy which we are optimizing/evaluating.