Markov reward process

cosmos 17th May 2017 at 10:07pm

A Markov decision process with a fixed policy