Experience replay
cosmos
1st October 2019 at 2:08am
Q-learning
Value function approximation
http://incompleteideas.net/book/bookdraft2017nov5.pdf#page=382