Hierarchical POMDP Controller Optimization by Likelihood Maximization

06/13/2012
by   Marc Toussaint, et al.
0

Planning can often be simpli ed by decomposing the task into smaller tasks arranged hierarchically. Charlin et al. [4] recently showed that the hierarchy discovery problem can be framed as a non-convex optimization problem. However, the inherent computational di culty of solving such an optimization problem makes it hard to scale to realworld problems. In another line of research, Toussaint et al. [18] developed a method to solve planning problems by maximumlikelihood estimation. In this paper, we show how the hierarchy discovery problem in partially observable domains can be tackled using a similar maximum likelihood approach. Our technique rst transforms the problem into a dynamic Bayesian network through which a hierarchical structure can naturally be discovered while optimizing the policy. Experimental results demonstrate that this approach scales better than previous techniques based on non-convex optimization.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2020

Learning Convex Optimization Models

A convex optimization model predicts an output from an input by solving ...
research
12/21/2017

Non-convex Optimization for Machine Learning

A vast majority of machine learning algorithms train their models and pe...
research
05/14/2018

FastLORS: Joint Modeling for eQTL Mapping in R

Yang et al. (2013) introduced LORS, a method that jointly models the exp...
research
05/18/2009

Colorization of Natural Images via L1 Optimization

Natural images in the colour space YUV have been observed to have a non-...
research
04/16/2018

Using Convex Optimization of Autocorrelation with Constrained Support and Windowing for Improved Phase Retrieval Accuracy

In imaging modalities recording diffraction data, the original image can...
research
11/05/2019

Computational Separations between Sampling and Optimization

Two commonly arising computational tasks in Bayesian learning are Optimi...
research
06/26/2023

GloptiNets: Scalable Non-Convex Optimization with Certificates

We present a novel approach to non-convex optimization with certificates...

Please sign up or login with your details

Forgot password? Click here to reset