Data Generation for Neural Programming by Example

11/06/2019
by   Judith Clymo, et al.
0

Programming by example is the problem of synthesizing a program from a small set of input / output pairs. Recent works applying machine learning methods to this task show promise, but are typically reliant on generating synthetic examples for training. A particular challenge lies in generating meaningful sets of inputs and outputs, which well-characterize a given program and accurately demonstrate its behavior. Where examples used for testing are generated by the same method as training data then the performance of a model may be partly reliant on this similarity. In this paper we introduce a novel approach using an SMT solver to synthesize inputs which cover a diverse set of behaviors for a given program. We carry out a case study comparing this method to existing synthetic data generation procedures in the literature, and find that data generated using our approach improves both the discriminatory power of example sets and the ability of trained machine learning models to generalize to unfamiliar data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/23/2020

Creating Synthetic Datasets via Evolution for Neural Program Synthesis

Program synthesis is the task of automatically generating a program cons...
research
08/28/2020

Type-driven Neural Programming by Example

In this thesis we look into programming by example (PBE), which is about...
research
06/19/2020

Learning to Prove from Synthetic Theorems

A major challenge in applying machine learning to automated theorem prov...
research
01/20/2023

MTGP: Combining Metamorphic Testing and Genetic Programming

Genetic programming is an evolutionary approach known for its performanc...
research
11/09/2017

Learning to select examples for program synthesis

Program synthesis is a class of regression problems where one seeks a so...
research
09/25/2018

Scenic: Language-Based Scene Generation

Synthetic data has proved increasingly useful in both training and testi...
research
02/28/2023

Generating Accurate Virtual Examples For Lifelong Machine Learning

Lifelong machine learning (LML) is an area of machine learning research ...

Please sign up or login with your details

Forgot password? Click here to reset