research
∙
12/15/2022
A Study on the Intersection of GPU Utilization and CNN Inference
There has been significant progress in developing neural network archite...
research
∙
04/19/2021
Arithmetic-Intensity-Guided Fault Tolerance for Neural Network Inference on GPUs
Neural networks (NNs) are increasingly employed in domains that require ...
research
∙
04/05/2021
ECRM: Efficient Fault Tolerance for Recommendation Model Training via Erasure Coding
Deep-learning-based recommendation models (DLRMs) are widely deployed to...
research
∙
05/02/2019
Parity Models: A General Framework for Coding-Based Resilience in ML Inference
Machine learning models are becoming the primary workhorses for many app...
research
∙
06/04/2018