Path Integral Control by Reproducing Kernel Hilbert Space Embedding

08/13/2012
by   Konrad Rawlik, et al.
0

We present an embedding of stochastic optimal control problems, of the so called path integral form, into reproducing kernel Hilbert spaces. Using consistent, sample based estimates of the embedding leads to a model free, non-parametric approach for calculation of an approximate solution to the control problem. This formulation admits a decomposition of the problem into an invariant and task dependent component. Consequently, we make much more efficient use of the sample data compared to previous sample based approaches in this domain, e.g., by allowing sample re-use across tasks. Numerical examples on test problems, which illustrate the sample efficiency, are provided.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2020

Consistency and Regression with Laplacian regularization in Reproducing Kernel Hilbert Space

This note explains a way to look at reproducing kernel Hilbert space for...
research
06/15/2022

A short note on compact embeddings of reproducing kernel Hilbert spaces in L^2 for infinite-variate function approximation

This note consists of two largely independent parts. In the first part w...
research
11/22/2018

Solving Chance Constrained Optimization under Non-Parametric Uncertainty Through Hilbert Space Embedding

In this paper, we present an efficient algorithm for solving a class of ...
research
05/13/2020

Adaptive Smoothing Path Integral Control

In Path Integral control problems a representation of an optimally contr...
research
09/14/2021

Koopman Linearization for Data-Driven Batch State Estimation of Control-Affine Systems

We present the Koopman State Estimator (KoopSE), a framework for model-f...
research
08/22/2023

Addressing Dynamic and Sparse Qualitative Data: A Hilbert Space Embedding of Categorical Variables

We propose a novel framework for incorporating qualitative data into qua...
research
09/24/2017

On the Optimality of Kernel-Embedding Based Goodness-of-Fit Tests

The reproducing kernel Hilbert space (RKHS) embedding of distributions o...

Please sign up or login with your details

Forgot password? Click here to reset