Deep Learning with Pytorch in a Nutshell

⌘Ctrlk

Tabular solution method

Dynamic Programming

Need all possible transitions probability

Monte Carlo

Temporal difference (TD)

PreviousProof of Bellman equation

Last updated 7 years ago