We study the data-scaling of transfer learning from foundation models in...
Running faster will only get you so far – it is generally advisable to f...
We show that the error of magnitude-pruned networks follows a scaling la...
The AlphaZero algorithm for the learning of strategy games via self-play...
The dependency of the generalization error of neural networks on model a...