Data intensive physics analysis in Azure cloud

10/25/2021
by   Igor Sfiligoi, et al.
0

The Compact Muon Solenoid (CMS) experiment at the Large Hadron Collider (LHC) is one of the largest data producers in the scientific world, with standard data products centrally produced, and then used by often competing teams within the collaboration. This work is focused on how a local institution, University of California San Diego (UCSD), partnered with the Open Science Grid (OSG) to use Azure cloud resources to augment its available computing to accelerate time to results for multiple analyses pursued by a small group of collaborators. The OSG is a federated infrastructure allowing many independent resource providers to serve many independent user communities in a transparent manner. Historically the resources would come from various research institutions, spanning small universities to large HPC centers, based on either community needs or grant allocations, so adding commercial clouds as resource providers is a natural evolution. The OSG technology allows for easy integration of cloud resources, but the data-intensive nature of CMS compute jobs required the deployment of additional data caching infrastructure to ensure high efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2010

Cloud Scheduler: a resource manager for distributed compute clouds

The availability of Infrastructure-as-a-Service (IaaS) computing clouds ...
research
07/08/2021

Expanding IceCube GPU computing into the Clouds

The IceCube collaboration relies on GPU compute for many of its needs, i...
research
06/08/2018

Intelligently-automated facilities expansion with the HEPCloud Decision Engine

The next generation of High Energy Physics experiments are expected to g...
research
12/27/2018

An efficient cloud scheduler design supporting preemptible instances

Maximizing resource utilization by performing an efficient resource prov...
research
04/18/2019

HEPCloud, an Elastic Hybrid HEP Facility using an Intelligent Decision Support System

HEPCloud is rapidly becoming the primary system for provisioning compute...
research
04/18/2020

Demonstrating a Pre-Exascale, Cost-Effective Multi-Cloud Environment for Scientific Computing

Scientific computing needs are growing dramatically with time and are ex...
research
05/02/2022

Auto-scaling HTCondor pools using Kubernetes compute resources

HTCondor has been very successful in managing globally distributed, plea...

Please sign up or login with your details

Forgot password? Click here to reset