The Software Heritage Filesystem (SwhFS): Integrating Source Code Archival with Development

02/12/2021
by   Thibault Allançon, et al.
0

We introduce the Software Heritage filesystem (SwhFS), a user-space filesystem that integrates large-scale open source software archival with development workflows. SwhFS provides a POSIX filesystem view of Software Heritage, the largest public archive of software source code and version control system (VCS) development history.Using SwhFS, developers can quickly "checkout" any of the 2 billion commits archived by Software Heritage, even after they disappear from their previous known location and without incurring the performance cost of repository cloning. SwhFS works across unrelated repositories and different VCS technologies. Other source code artifacts archived by Software Heritage-individual source code files and trees, releases, and branches-can also be accessed using common programming tools and custom scripts, as if they were locally available.A screencast of SwhFS is available online at dx.doi.org/10.5281/zenodo.4531411.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2020

The Software Heritage Graph Dataset: Large-scale Analysis of Public Software Development History

Software Heritage is the largest existing public archive of software sou...
research
06/19/2019

Growth and Duplication of Public Source Code over Time: Provenance Tracking at Scale

We study the evolution of the largest known corpus of publicly available...
research
03/06/2020

Code Obfuscation for the C/C++ Language

Obfuscation is the action of making something unintelligible. In softwar...
research
03/20/2018

Public Git Archive: a Big Code dataset for all

The number of open source software projects has been growing exponential...
research
03/02/2021

Apples, Oranges Fruits – Understanding Similarity of Software Projects Through The Lens of Dissimilar Artifacts

The growing availability of open source projects has facilitated develop...
research
07/22/2022

Efficient Prior Publication Identification for Open Source Code

Free/Open Source Software (FOSS) enables large-scale reuse of preexistin...
research
08/16/2022

Identifying Source Code File Experts

In software development, the identification of source code file experts ...

Please sign up or login with your details

Forgot password? Click here to reset