Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions

Yasin Abbasi-Yadkori, Peter L. Bartlett, Varun Kanade, Yevgeny Seldin, Csaba Szepesvári

27 Citationer (Scopus)

Fingeraftryk

Dyk ned i forskningsemnerne om 'Online Learning in Markov Decision Processes with Adversarially Chosen Transition Probability Distributions'. Sammen danner de et unikt fingeraftryk.

Teknik og materialevidenskab