Deductive approach
Start from the final reward
model.
Do a reward regression in terms
of abstract states itself using a
situation calculus framework.
Extremely slow – requires a
need for a theorem prover.