research
∙
01/21/2021
Distilling Large Language Models into Tiny and Effective Students using pQRNN
Large pre-trained multilingual models like mBERT, XLM-R achieve state of...
research
∙
04/30/2012