Disjunctive - Total
•
Output : Policy (State -> Action)
•
Bellman Equation
V*(s)=min
aεA(s)
[c(a)+max
s’εF(a,s)
V*(s’)]