Abstract
We introduce the notion of LImited Memory Influence Diagram (LIMID) to describe multistage decision problems in which the traditional assumption of no forgetting is relaxed. This can be relevant in situations with multiple decision makers or when decisions must be prescribed under memory constraints, such as in partially observed Markov decision processes (POMDPs). We give an algorithm for improving any given strategy by local computation of single policy updates and investigate conditions for the resulting strategy to be optimal.
Original language | English |
---|---|
Journal | Management Science |
Volume | 47 |
Issue number | 9 |
Pages (from-to) | 1235-1251 |
Number of pages | 17 |
ISSN | 0025-1909 |
DOIs | |
Publication status | Published - 2001 |
Externally published | Yes |