State-action pair

cosmos 15th July 2017 at 8:35pm
Markov decision process

A node in a Markov decision process corresponding to "taking action A just after visiting state S". See Sutton-Barto book