Large Language Models Based Automatic Synthesis of Software Specifications

04/18/2023
by   Shantanu Mandal, et al.
0

Software configurations play a crucial role in determining the behavior of software systems. In order to ensure safe and error-free operation, it is necessary to identify the correct configuration, along with their valid bounds and rules, which are commonly referred to as software specifications. As software systems grow in complexity and scale, the number of configurations and associated specifications required to ensure the correct operation can become large and prohibitively difficult to manipulate manually. Due to the fast pace of software development, it is often the case that correct software specifications are not thoroughly checked or validated within the software itself. Rather, they are frequently discussed and documented in a variety of external sources, including software manuals, code comments, and online discussion forums. Therefore, it is hard for the system administrator to know the correct specifications of configurations due to the lack of clarity, organization, and a centralized unified source to look at. To address this challenge, we propose SpecSyn a framework that leverages a state-of-the-art large language model to automatically synthesize software specifications from natural language sources. Our approach formulates software specification synthesis as a sequence-to-sequence learning problem and investigates the extraction of specifications from large contextual texts. This is the first work that uses a large language model for end-to-end specification synthesis from natural language texts. Empirical results demonstrate that our system outperforms prior the state-of-the-art specification synthesis tool by 21 terms of F1 score and can find specifications from single as well as multiple sentences.

READ FULL TEXT
research
05/25/2023

Learning-Based Automatic Synthesis of Software Code and Configuration

Increasing demands in software industry and scarcity of software enginee...
research
06/06/2023

Impact of Large Language Models on Generating Software Specifications

Software specifications are essential for ensuring the reliability of so...
research
01/24/2023

Taming Large Bounds in Synthesis from Bounded-Liveness Specifications (Full Version)

Automatic synthesis from temporal logic specifications is an attractive ...
research
07/11/2023

What do LLMs need to Synthesize Correct Router Configurations?

We investigate whether Large Language Models (e.g., GPT-4) can synthesiz...
research
10/11/2021

Generalizing to New Domains by Mapping Natural Language to Lifted LTL

Recent work on using natural language to specify commands to robots has ...
research
11/17/2017

Towards operational natural language

The multiplicity of software projects' stakeholders and activities leads...
research
08/08/2023

Adapting Foundation Models for Information Synthesis of Wireless Communication Specifications

Existing approaches to understanding, developing and researching modern ...

Please sign up or login with your details

Forgot password? Click here to reset