policyIteAve {MDP} | R Documentation |
Perform policy iteration (average criterion) on the MDP.
policyIteAve(mdp, iW, iDur)
mdp |
The MDP loaded using loadMDP. |
iW |
Weight index we want to optimize with respect to. |
iDur |
Index of duration such that discount rates can be calculated. |
The policy can afterwards be recieved using functions getPolicy
and getPolicyW
.
The optimal gain (g) calculated.
Lars Relund lars@relund.dk