Workflows Community Summit 2022: A Roadmap Revolution

by   Rafael Ferreira da Silva, et al.

Scientific workflows have become integral tools in broad scientific computing use cases. Science discovery is increasingly dependent on workflows to orchestrate large and complex scientific experiments that range from execution of a cloud-based data preprocessing pipeline to multi-facility instrument-to-edge-to-HPC computational workflows. Given the changing landscape of scientific computing and the evolving needs of emerging scientific applications, it is paramount that the development of novel scientific workflows and system functionalities seek to increase the efficiency, resilience, and pervasiveness of existing systems and applications. Specifically, the proliferation of machine learning/artificial intelligence (ML/AI) workflows, need for processing large scale datasets produced by instruments at the edge, intensification of near real-time data processing, support for long-term experiment campaigns, and emergence of quantum computing as an adjunct to HPC, have significantly changed the functional and operational requirements of workflow systems. Workflow systems now need to, for example, support data streams from the edge-to-cloud-to-HPC enable the management of many small-sized files, allow data reduction while ensuring high accuracy, orchestrate distributed services (workflows, instruments, data movement, provenance, publication, etc.) across computing and user facilities, among others. Further, to accelerate science, it is also necessary that these systems implement specifications/standards and APIs for seamless (horizontal and vertical) integration between systems and applications, as well as enabling the publication of workflows and their associated products according to the FAIR principles. This document reports on discussions and findings from the 2022 international edition of the Workflows Community Summit that took place on November 29 and 30, 2022.


page 1

page 5

page 30


Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability

Modern large-scale scientific discovery requires multidisciplinary colla...

Linking Scientific Instruments and HPC: Patterns, Technologies, Experiences

Powerful detectors at modern experimental facilities routinely collect d...

Supporting High-Performance and High-Throughput Computing for Experimental Science

The advent of experimental science facilities, instruments and observato...

Globus Automation Services: Research process automation across the space-time continuum

Research process automation–the reliable, efficient, and reproducible ex...

A Community Roadmap for Scientific Workflows Research and Development

The landscape of workflow systems for scientific applications is notorio...

Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development

Scientific workflows are a cornerstone of modern scientific computing, a...

Pseudonymization at Scale: OLCF's Summit Usage Data Case Study

The analysis of vast amounts of data and the processing of complex compu...

Please sign up or login with your details

Forgot password? Click here to reset