Now It Sounds Like You: Learning Personalized Vocabulary On Device

05/05/2023
by   Sid Wang, et al.
0

In recent years, Federated Learning (FL) has shown significant advancements in its ability to perform various natural language processing (NLP) tasks. This work focuses on applying personalized FL for on-device language modeling. Due to limitations of memory and latency, these models cannot support the complexity of sub-word tokenization or beam search decoding, resulting in the decision to deploy a closed-vocabulary language model. However, closed-vocabulary models are unable to handle out-of-vocabulary (OOV) words belonging to specific users. To address this issue, We propose a novel technique called "OOV expansion" that improves OOV coverage and increases model accuracy while minimizing the impact on memory and latency. This method introduces a personalized "OOV adapter" that effectively transfers knowledge from a central model and learns word embedding for personalized vocabulary. OOV expansion significantly outperforms standard FL personalization methods on a set of common FL benchmarks.

READ FULL TEXT
research
06/06/2022

Pretrained Models for Multilingual Federated Learning

Since the advent of Federated Learning (FL), research has applied these ...
research
07/28/2021

New Metrics to Evaluate the Performance and Fairness of Personalized Federated Learning

In Federated Learning (FL), the clients learn a single global model (Fed...
research
01/27/2022

Achieving Personalized Federated Learning with Sparse Local Models

Federated learning (FL) is vulnerable to heterogeneously distributed dat...
research
01/28/2022

A Secure and Efficient Federated Learning Framework for NLP

In this work, we consider the problem of designing secure and efficient ...
research
02/19/2021

Personalized Federated Learning: A Unified Framework and Universal Optimization Techniques

We study the optimization aspects of personalized Federated Learning (FL...
research
02/16/2021

Federated Evaluation and Tuning for On-Device Personalization: System Design Applications

We describe the design of our federated task processing system. Original...
research
04/24/2023

Semantic Tokenizer for Enhanced Natural Language Processing

Traditionally, NLP performance improvement has been focused on improving...

Please sign up or login with your details

Forgot password? Click here to reset