research
∙
02/22/2018
Pattern-based Modeling of Multiresilience Solutions for High-Performance Computing
Resiliency is the ability of large-scale high-performance computing (HPC...
research
∙
01/14/2018
Shrink or Substitute: Handling Process Failures in HPC Systems using In-situ Recovery
Efficient utilization of today's high-performance computing (HPC) system...
research
∙
10/25/2017
A Pattern Language for High-Performance Computing Resilience
High-performance computing systems (HPC) provide powerful capabilities f...
research
∙
08/23/2017
Big Data Meets HPC Log Analytics: Scalable Approach to Understanding Systems at Extreme Scale
Today's high-performance computing (HPC) systems are heavily instrumente...
research
∙
08/23/2017