Policy Improvement Algorithm - Defining Data Previous Next
The cost data given earlier provides the  values shown below, where infinity is entered by entering a - for infeasible combinations of  and .
The transition probabilities given earlier yield the  values shown below. Note. if a good machine (state 0) is "repaired" by either Expert or Operator repair, then it will automatically be up an hour later.
Starting with  ("Operator repair"), we now execute the algorithm roughly as it is done with the interactive routine.