Adapters have emerged as a modular and parameter-efficient approach to
(...
Massively multilingual Transformers (MMTs), such as mBERT and XLM-R, are...
Fine-tuning all parameters of a pre-trained model has become the mainstr...
To avoid the "meaning conflation deficiency" of word embeddings, a numbe...