An Anytime Algorithm for Task and Motion MDPs

by   Siddharth Srivastava, et al.

Integrated task and motion planning has emerged as a challenging problem in sequential decision making, where a robot needs to compute high-level strategy and low-level motion plans for solving complex tasks. While high-level strategies require decision making over longer time-horizons and scales, their feasibility depends on low-level constraints based upon the geometries and continuous dynamics of the environment. The hybrid nature of this problem makes it difficult to scale; most existing approaches focus on deterministic, fully observable scenarios. We present a new approach where the high-level decision problem occurs in a stochastic setting and can be modeled as a Markov decision process. In contrast to prior efforts, we show that complete MDP policies, or contingent behaviors, can be computed effectively in an anytime fashion. Our algorithm continuously improves the quality of the solution and is guaranteed to be probabilistically complete. We evaluate the performance of our approach on a challenging, realistic test problem: autonomous aircraft inspection. Our results show that we can effectively compute consistent task and motion policies for the most likely execution-time outcomes using only a fraction of the computation required to develop the complete task and motion policy.


page 1

page 2

page 3

page 4


Anytime Stochastic Task and Motion Policies

In order to solve complex, long-horizon tasks, intelligent robots need t...

Anytime Integrated Task and Motion Policies for Stochastic Environments

In order to solve complex, long-horizon tasks, intelligent robots need t...

Extended Task and Motion Planning of Long-horizon Robot Manipulation

Task and Motion Planning (TAMP) requires the integration of symbolic rea...

Online Planning of Uncertain MDPs under Temporal Tasks and Safe-Return Constraints

This paper addresses the online motion planning problem of mobile robots...

Semantic-level Decentralized Multi-Robot Decision-Making using Probabilistic Macro-Observations

Robust environment perception is essential for decision-making on robots...

POMDP Model Learning for Human Robot Collaboration

Recent years have seen human robot collaboration (HRC) quickly emerged a...

Please sign up or login with your details

Forgot password? Click here to reset