Data-driven Algorithm Design

11/14/2020
by   Maria-Florina Balcan, et al.
0

Data driven algorithm design is an important aspect of modern data science and algorithm design. Rather than using off the shelf algorithms that only have worst case performance guarantees, practitioners often optimize over large families of parametrized algorithms and tune the parameters of these algorithms using a training set of problem instances from their domain to determine a configuration with high expected performance over future instances. However, most of this work comes with no performance guarantees. The challenge is that for many combinatorial problems of significant importance including partitioning, subset selection, and alignment problems, a small tweak to the parameters can cause a cascade of changes in the algorithm's behavior, so the algorithm's performance is a discontinuous function of its parameters. In this chapter, we survey recent work that helps put data-driven combinatorial algorithm design on firm foundations. We provide strong computational and statistical performance guarantees, both for the batch and online scenarios where a collection of typical problem instances from the given application are presented either all at once or in an online fashion, respectively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/24/2022

The Online Knapsack Problem with Departures

The online knapsack problem is a classic online resource allocation prob...
research
04/07/2022

Faster algorithms for learning to link, align sequences, and price two-part tariffs

Data-driven algorithm configuration is a promising, learning-based appro...
research
05/26/2019

Learning to Optimize Computational Resources: Frugal Training with Generalization Guarantees

Algorithms typically come with tunable parameters that have a considerab...
research
08/08/2019

How much data is sufficient to learn high-performing algorithms?

Algorithms for scientific analysis typically have tunable parameters tha...
research
12/24/2020

Generalization in portfolio-based algorithm selection

Portfolio-based algorithm selection has seen tremendous practical succes...
research
06/21/2020

Refined bounds for algorithm configuration: The knife-edge of dual class approximability

Automating algorithm configuration is growing increasingly necessary as ...
research
03/18/2021

Data driven algorithms for limited labeled data learning

We consider a novel data driven approach for designing learning algorith...

Please sign up or login with your details

Forgot password? Click here to reset