Heuristic search

cosmos 17th July 2017 at 3:28pm
Decision-time planning

Classical decision-time algorithm in Artificial intelligence. For each state encountered, a large tree of possible continuations is considered. The approximate value function is applied to the leaf nodes and then backed up toward the current state at the root.

See sec 8.9 in Sutton-Barto