Modified Policy iteration
•
Rather than evaluating the actual value of
policy by solving system of equations,
approximate it by using value iteration with
fixed policy.