Learning General Policies from Small Examples Without Supervision

01/03/2021
by   Guillem Francès, et al.
0

Generalized planning is concerned with the computation of general policies that solve multiple instances of a planning domain all at once. It has been recently shown that these policies can be computed in two steps: first, a suitable abstraction in the form of a qualitative numerical planning problem (QNP) is learned from sample plans, then the general policies are obtained from the learned QNP using a planner. In this work, we introduce an alternative approach for computing more expressive general policies which does not require sample plans or a QNP planner. The new formulation is very simple and can be cast in terms that are more standard in machine learning: a large but finite pool of features is defined from the predicates in the planning examples using a general grammar, and a small subset of features is sought for separating "good" from "bad" state transitions, and goals from non-goals. The problems of finding such a "separating surface" while labeling the transitions as "good" or "bad" are jointly addressed as a single combinatorial optimization problem expressed as a Weighted Max-SAT problem. The advantage of looking for the simplest policy in the given feature space that solves the given examples, possibly non-optimally, is that many domains have no general, compact policies that are optimal. The approach yields general policies for a number of benchmark domains.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/17/2018

Learning Features and Abstract Actions for Computing Generalized Plans

Generalized planning is concerned with the computation of plans that sol...
research
09/21/2021

Learning General Optimal Policies with Graph Neural Networks: Expressive Power, Transparency, and Limits

It has been recently shown that general policies for many classical plan...
research
12/15/2020

General Policies, Serializations, and Planning Width

It has been observed that in many of the benchmark planning domains, ato...
research
05/12/2022

Learning Generalized Policies Without Supervision Using GNNs

We consider the problem of learning generalized policies for classical p...
research
01/30/2018

Features, Projections, and Representation Change for Generalized Planning

Generalized planning is concerned with the characterization and computat...
research
05/28/2019

Guarantees for Sound Abstractions for Generalized Planning (Extended Paper)

Generalized planning is about finding plans that solve collections of pl...
research
05/10/2021

Expressing and Exploiting the Common Subgoal Structure of Classical Planning Domains Using Sketches: Extended Version

Width-based planning methods exploit the use of conjunctive goals for de...

Please sign up or login with your details

Forgot password? Click here to reset