This paper proposes an efficient optimizer called AdaPlus which integrat...
In this paper, we propose a general deep learning training framework XGr...
In this paper, we introduce weight prediction into the AdamW optimizer t...
Covert transmission is studied for an intelligent reflecting surface (IR...
We propose XPipe, an efficient asynchronous pipeline model parallelism
a...
In this paper, we revisit the convergence of the Heavy-ball method, and
...
Support vector machines (SVMs) with sparsity-inducing nonconvex penaltie...