Recap LP solution
•
Minimise Σ
sεS
α(s)V*(s)
Under constraints
For every s, a
V*(s) ≥ R(s) +
γΣ
s’εS
Pr(s’|a,s)V*(s’)
•
α(s) > 0
•
Solution : infeasible as exponential
variables, exponential constraints.