Coresets via Bilevel Optimization for Continual Learning and Streaming

by   Zalán Borsos, et al.

Coresets are small data summaries that are sufficient for model training. They can be maintained online, enabling efficient handling of large data streams under resource constraints. However, existing constructions are limited to simple models such as k-means and logistic regression. In this work, we propose a novel coreset construction via cardinality-constrained bilevel optimization. We show how our framework can efficiently generate coresets for deep neural networks, and demonstrate its empirical benefits in continual learning and in streaming settings.


page 2

page 7

page 20


Data Summarization via Bilevel Optimization

The increasing availability of massive data sets poses a series of chall...

Online Continual Learning on Class Incremental Blurry Task Configuration with Anytime Inference

Despite rapid advances in continual learning, a large body of research i...

Adaptive Explainable Continual Learning Framework for Regression Problems with Focus on Power Forecasts

Compared with traditional deep learning techniques, continual learning e...

SIESTA: Efficient Online Continual Learning with Sleep

In supervised continual learning, a deep neural network (DNN) is updated...

Continual learning autoencoder training for a particle-in-cell simulation via streaming

The upcoming exascale era will provide a new generation of physics simul...

Explainable Lifelong Stream Learning Based on "Glocal" Pairwise Fusion

Real-time on-device continual learning applications are used on mobile p...

One Pass ImageNet

We present the One Pass ImageNet (OPIN) problem, which aims to study the...

Code Repositories


Coresets via Bilevel Optimization

view repo

Please sign up or login with your details

Forgot password? Click here to reset