Real-time dynamic programming: Cosmos — All that is, or was, or ever will be

Real-time dynamic programming

cosmos 18th July 2017 at 2:40pm

RTDP

Asynchronous dynamic programming which uses On-policy trajectory sampling for choosing the state which are going to be backed up.

RTDP algorithms like (learning real-time A*) are applied to Stochastic optimal path problems