TD model: Cosmos — All that is, or was, or ever will be

TD model

cosmos 28th July 2017 at 8:00pm

A model of Learning (in particular Classical conditioning)

See sec 14.2.4 of Sutton-Barto

It corresponds to Temporal difference learning with Linear approximation of the value function (which in this context need not represent value, but intensity of response..)