DUETS: A Dataset of Reproducible Pairs ofJava Library-Clients

03/17/2021
by   Thomas Durieux, et al.
0

Software engineering researchers look for software artifacts to study their characteristics or to evaluate new techniques. In this paper, we introduce DUETS, a new dataset of software libraries and their clients. This dataset can be exploited to gain many different insights, such as API usage, usage inputs, or novel observations about the test suites of clients and libraries. DUETS is meant to support both static and dynamic analysis. This means that the libraries and the clients compile correctly, they are executable and their test suites pass. The dataset is composed of open-source projects that have more than five stars on GitHub. The final dataset contains 395 libraries and 2,874 clients. Additionally, we provide the raw data that we use to create this dataset, such as 34,560 pom.xml files or the complete file list from 34,560 projects. This dataset can be used to study how libraries are used by their clients or as a list of software projects that successfully build. The client's test suite can be used as an additional verification step for code transformation techniques that modify the libraries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/09/2021

Analyzing the Impact of Pull Requests to Guide Library Evolution

"If we make this change to our code, how will it impact our clients?" It...
research
09/07/2022

So Much in So Little: Creating Lightweight Embeddings of Python Libraries

In software engineering, different approaches and machine learning model...
research
03/01/2021

Offshore Software Maintenance Outsourcing Predicting Clients Proposal using Supervised Learning

In software engineering, software maintenance is the process of correcti...
research
10/15/2021

Breaking Bad? Semantic Versioning and Impact of Breaking Changes in Maven Central

Just like any software, libraries evolve to incorporate new features, bu...
research
07/12/2019

Framework Code Samples: How Are They Maintained and Used by Developers?

Background: Modern software systems are commonly built on the top of fra...
research
08/26/2019

Analyzing 2.3 Million Maven Dependencies to Reveal an Essential Core in APIs

This paper addresses the following question: does a small, essential, co...
research
02/08/2022

SPIDER: Specification-based Integration Defect Revealer

Modern software design practice implies widespread use in the developmen...

Please sign up or login with your details

Forgot password? Click here to reset