Finding Approximate POMDP solutions Through Belief Compression

1 January 2005

journal article
Published by AI Access Foundation in Journal of Artificial Intelligence Research

Vol. 23 (1) , 1-40
https://doi.org/10.1613/jair.1496

Abstract

We introduce a new method for solving large-scale POMDPs by reducing the dimensionality of the belief space. We use Exponential family Principal Components Analysis (Collins, Dasgupta & Schapire, 2002) to represent sparse, high-dimensional belief spaces using small sets of learned features of the belief state. We then plan only in terms of the low-dimensional belief features. By planning in this low-dimensional space, we can find policies for POMDP models that are orders of magnitude larger than models that can be handled by conventional techniques. We demonstrate the use of this algorithm on a synthetic problem and on mobile robot navigation tasks.

Keywords

COMPLICATED POLICY CLASS
BELIEF SPACE
FULL BELIEF SPACE
HIGH-DIMENSIONAL BELIEF SPACE
FULL VALUE FUNCTION
BELIEF STATE
APPROXIMATE POMDP SOLUTION
LOW-DIMENSIONAL BELIEF FEATURE
LOW-DIMENSIONAL SPACE
BELIEF COMPRESSION
ENTIRE BELIEF SPACE
OPTIMAL POLICY

All Related Versions

Version 1, 2011-06-30, ArXiv
Version 2, 2011-10-04, ArXiv

This publication has 0 references indexed in Scilit: