Embodied Active Learning of Relational State Abstractions for Bilevel Planning

03/08/2023
by   Amber Li, et al.
0

State abstraction is an effective technique for planning in robotics environments with continuous states and actions, long task horizons, and sparse feedback. In object-oriented environments, predicates are a particularly useful form of state abstraction because of their compatibility with symbolic planners and their capacity for relational generalization. However, to plan with predicates, the agent must be able to interpret them in continuous environment states (i.e., ground the symbols). Manually programming predicate interpretations can be difficult, so we would instead like to learn them from data. We propose an embodied active learning paradigm where the agent learns predicate interpretations through online interaction with an expert. For example, after taking actions in a block stacking environment, the agent may ask the expert: "Is On(block1, block2) true?" From this experience, the agent learns to plan: it learns neural predicate interpretations, symbolic planning operators, and neural samplers that can be used for bilevel planning. During exploration, the agent plans to learn: it uses its current models to select actions towards generating informative expert queries. We learn predicate interpretations as ensembles of neural networks and use their entropy to measure the informativeness of potential queries. We evaluate this approach in three robotic environments and find that it consistently outperforms six baselines while exhibiting sample efficiency in two key metrics: number of environment interactions, and number of queries to the expert. Code: https://tinyurl.com/active-predicates

READ FULL TEXT

page 1

page 7

research
03/17/2022

Inventing Relational State and Action Abstractions for Effective and Efficient Bilevel Planning

Effective and efficient planning in continuous state and action spaces i...
research
06/21/2022

Learning Neuro-Symbolic Skills for Bilevel Planning

Decision-making is challenging in robotics environments with continuous ...
research
08/16/2022

Learning Operators with Ignore Effects for Bilevel Planning in Continuous Domains

Bilevel planning, in which a high-level search over an abstraction of an...
research
06/18/2019

RadGrad: Active learning with loss gradients

Solving sequential decision prediction problems, including those in imit...
research
08/15/2021

Deep Active Learning for Text Classification with Diverse Interpretations

Recently, Deep Neural Networks (DNNs) have made remarkable progress for ...
research
07/26/2020

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs

Meta-planning, or learning to guide planning from experience, is a promi...
research
06/01/2022

Active Inference for Robotic Manipulation

Robotic manipulation stands as a largely unsolved problem despite signif...

Please sign up or login with your details

Forgot password? Click here to reset