research
∙
10/28/2022
Flatter, faster: scaling momentum for optimal speedup of SGD
Commonly used optimization algorithms often show a trade-off between goo...
research
∙
12/15/2019