Tangent Model Composition for Ensembling and Continual Fine-tuning

07/16/2023
by   Tian Yu Liu, et al.
0

Tangent Model Composition (TMC) is a method to combine component models independently fine-tuned around a pre-trained point. Component models are tangent vectors to the pre-trained model that can be added, scaled, or subtracted to support incremental learning, ensembling, or unlearning. Component models are composed at inference time via scalar combination, reducing the cost of ensembling to that of a single model. TMC improves accuracy by 4.2 2.5x to 10x reduction of inference cost, growing linearly with the number of component models. Each component model can be forgotten at zero cost, with no residual effect on the resulting inference. When used for continual fine-tuning, TMC is not constrained by sequential bias and can be executed in parallel on federated data. TMC outperforms recently published continual fine-tuning methods almost uniformly on each setting – task-incremental, class-incremental, and data-incremental – on a total of 13 experiments across 3 benchmark datasets, despite not using any replay buffer. TMC is designed for composing models that are local to a pre-trained embedding, but could be extended to more general settings.

READ FULL TEXT
research
06/21/2023

Continual Learners are Incremental Model Generalizers

Motivated by the efficiency and rapid convergence of pre-trained models ...
research
08/17/2022

DLCFT: Deep Linear Continual Fine-Tuning for General Incremental Learning

Pre-trained representation is one of the key elements in the success of ...
research
07/16/2023

Tangent Transformers for Composition, Privacy and Removal

We introduce Tangent Attention Fine-Tuning (TAFT), a method for fine-tun...
research
06/01/2023

Prompt Algebra for Task Composition

We investigate whether prompts learned independently for different tasks...
research
04/22/2022

Alleviating Representational Shift for Continual Fine-tuning

We study a practical setting of continual learning: fine-tuning on a pre...
research
10/07/2019

Task-Adaptive Incremental Learning for Intelligent Edge Devices

Convolutional Neural Networks (CNNs) are used for a wide range of image-...
research
09/01/2022

Incremental Online Learning Algorithms Comparison for Gesture and Visual Smart Sensors

Tiny machine learning (TinyML) in IoT systems exploits MCUs as edge devi...

Please sign up or login with your details

Forgot password? Click here to reset