Probabilistic - Total
•
Modelled as MDPs. (also called fully
observable MDPs)
•
Output : Policy (State
Action)
•
Bellman Equation
V*(s)=min
aεA(s)
[c(a)+Σ
s’εS
V*(s’)P(s’|s,a)]