Abstract
We present algorithms for finding optimal strategies for discounted, infinite-horizon, Determinsitc Markov Decision Processes (DMDPs). Our fastest algorithm has a worst-case running time of O(mn), improving the recent bound of O(mn2) obtained by Andersson and Vorbyov [2006]. We also present a randomized O(m1/2n2)-time algorithm for finding Discounted All-Pairs Shortest Paths (DAPSP), improving an O(mn2)-time algorithm that can be obtained using ideas of Papadimitriou and Tsitsiklis [1987].
Bidragets oversatte titel | Discounted deterministic Markov decision processes and discounted all-pairs shortest paths |
---|---|
Originalsprog | Engelsk |
Tidsskrift | ACM Transactions on Algorithms |
Vol/bind | 6 |
Udgave nummer | 2 |
ISSN | 1549-6325 |
Status | Udgivet - 1 mar. 2010 |
Udgivet eksternt | Ja |