Recap LP solution
Minimise ΣsεS α(s)V*(s)
Under constraints
For every s, a
V*(s) ≥ R(s) +
            γΣs’εS Pr(s’|a,s)V*(s’)
α(s) > 0
Solution : infeasible as exponential
variables, exponential constraints.