Policy Improvement Algorithm - Formulation Previous Next
The question is which mode of repair should be used when the computer is down.
This question is easily answered by calculating , the long-run average cost per unit time, for the two alternatives. Let us apply this exhaustive enumeration approach first before applying the policy improvement algorithm. Letting
State 0 = computer is up,
State 1 = computer is down,
these calculations are summarized below.
Operator Repair:


Expert Repair: