Tabular solution method

Dynamic Programming

  • Need all possible transitions probability

Monte Carlo

Temporal difference (TD)

Last updated