research
∙
07/07/2023
DEFT: Exploiting Gradient Norm Difference between Model Layers for Scalable Gradient Sparsification
Gradient sparsification is a widely adopted solution for reducing the ex...
research
∙
09/18/2022