Policy Improvement Algorithm - Changing Policies Previous Next
Old Policy:

New Policy:

After the second iteration of the policy improvement algorithm, the policy has not changed. That is, the algorithm states that "Expert repair" should be used when the computer is down.
s the current solution optimal?
Yes, because the new policy is not different from the old policy. When two successive iterations of the policy improvement algorithm yield the same result, then the solution is necessarily optimal.
Optimal Policy:

This concludes the demonstration. See the OR Tutor menu (to the left) for other demonstrations or close the browser window to exit OR Tutor.