TUTCRIS - Tampereen teknillinen yliopisto

TUTCRIS

Periodic finite state controllers for efficient POMDP and DEC-POMDP planning

Tutkimustuotosvertaisarvioitu

Yksityiskohdat

AlkuperäiskieliEnglanti
OtsikkoAdvances in Neural Information Processing Systems 24: 25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011
TilaJulkaistu - 2011
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
Tapahtuma25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011 - Granada, Espanja
Kesto: 12 joulukuuta 201114 joulukuuta 2011

Conference

Conference25th Annual Conference on Neural Information Processing Systems 2011, NIPS 2011
MaaEspanja
KaupunkiGranada
Ajanjakso12/12/1114/12/11

Tiivistelmä

Applications such as robot control and wireless communication require planning under uncertainty. Partially observable Markov decision processes (POMDPs) plan policies for single agents under uncertainty and their decentralized versions (DEC-POMDPs) find a policy for multiple agents. The policy in infinite-horizon POMDP and DEC-POMDP problems has been represented as finite state controllers (FSCs). We introduce a novel class of periodic FSCs, composed of layers connected only to the previous and next layer. Our periodic FSC method finds a deterministic finite-horizon policy and converts it to an initial periodic infinitehorizon policy. This policy is optimized by a new infinite-horizon algorithm to yield deterministic periodic policies, and by a new expectation maximization algorithm to yield stochastic periodic policies. Our method yields better results than earlier planningmethods and can compute larger solutions than with regular FSCs.

!!ASJC Scopus subject areas