Variational Inference MPC using Normalizing Flows and Out-of-Distribution Projection

05/10/2022
by   Thomas Power, et al.
0

We propose a Model Predictive Control (MPC) method for collision-free navigation that uses amortized variational inference to approximate the distribution of optimal control sequences by training a normalizing flow conditioned on the start, goal and environment. This representation allows us to learn a distribution that accounts for both the dynamics of the robot and complex obstacle geometries. We can then sample from this distribution to produce control sequences which are likely to be both goal-directed and collision-free as part of our proposed FlowMPPI sampling-based MPC method. However, when deploying this method, the robot may encounter an out-of-distribution (OOD) environment, i.e. one which is radically different from those used in training. In such cases, the learned flow cannot be trusted to produce low-cost control sequences. To generalize our method to OOD environments we also present an approach that performs projection on the representation of the environment as part of the MPC process. This projection changes the environment representation to be more in-distribution while also optimizing trajectory quality in the true environment. Our simulation results on a 2D double-integrator and a 3D 12DoF underactuated quadrotor suggest that FlowMPPI with projection outperforms state-of-the-art MPC baselines on both in-distribution and OOD environments, including OOD environments generated from real-world data.

READ FULL TEXT

page 1

page 4

page 7

page 8

research
03/30/2022

Autonomous Navigation of AGVs in Unknown Cluttered Environments: log-MPPI Control Strategy

Sampling-based model predictive control (MPC) optimization methods, such...
research
03/23/2021

Dual Online Stein Variational Inference for Control and Dynamics

Model predictive control (MPC) schemes have a proven track record for de...
research
04/01/2021

Variational Inference MPC using Tsallis Divergence

In this paper, we provide a generalized framework for Variational Infere...
research
02/24/2022

A Collision-Free MPC for Whole-Body Dynamic Locomotion and Manipulation

In this paper, we present a real-time whole-body planner for collision-f...
research
07/08/2019

Variational Inference MPC for Bayesian Model-based Reinforcement Learning

In recent studies on model-based reinforcement learning (MBRL), incorpor...
research
02/25/2021

Where to go next: Learning a Subgoal Recommendation Policy for Navigation Among Pedestrians

Robotic navigation in environments shared with other robots or humans re...
research
07/21/2022

Learning Deep SDF Maps Online for Robot Navigation and Exploration

We propose an algorithm to (i) learn online a deep signed distance funct...

Please sign up or login with your details

Forgot password? Click here to reset