Abstract
We present algorithms for finding optimal strategies for discounted, infinite-horizon, Determinsitc Markov Decision Processes (DMDPs). Our fastest algorithm has a worst-case running time of O(mn), improving the recent bound of O(mn2) obtained by Andersson and Vorbyov [2006]. We also present a randomized O(m1/2n2)-time algorithm for finding Discounted All-Pairs Shortest Paths (DAPSP), improving an O(mn2)-time algorithm that can be obtained using ideas of Papadimitriou and Tsitsiklis [1987].
Translated title of the contribution | Discounted deterministic Markov decision processes and discounted all-pairs shortest paths |
---|---|
Original language | English |
Journal | ACM Transactions on Algorithms |
Volume | 6 |
Issue number | 2 |
ISSN | 1549-6325 |
Publication status | Published - 1 Mar 2010 |
Externally published | Yes |