Distributed Submodular Maximization

11/03/2014
by   Baharan Mirzasoleiman, et al.
0

Many large-scale machine learning problems--clustering, non-parametric learning, kernel machines, etc.--require selecting a small yet representative subset from a large dataset. Such problems can often be reduced to maximizing a submodular set function subject to various constraints. Classical approaches to submodular optimization require centralized access to the full dataset, which is impractical for truly large-scale problems. In this paper, we consider the problem of submodular function maximization in a distributed fashion. We develop a simple, two-stage protocol GreeDi, that is easily implemented using MapReduce style computations. We theoretically analyze our approach, and show that under certain natural conditions, performance close to the centralized approach can be achieved. We begin with monotone submodular maximization subject to a cardinality constraint, and then extend this approach to obtain approximation guarantees for (not necessarily monotone) submodular maximization subject to more general constraints including matroid or knapsack constraints. In our extensive experiments, we demonstrate the effectiveness of our approach on several applications, including sparse Gaussian process inference and exemplar based clustering on tens of millions of examples using Hadoop.

READ FULL TEXT

page 5

page 24

research
08/21/2023

Practical Parallel Algorithms for Non-Monotone Submodular Maximization

Submodular maximization has found extensive applications in various doma...
research
05/31/2016

Horizontally Scalable Submodular Maximization

A variety of large-scale machine learning problems can be cast as instan...
research
08/06/2018

Beyond 1/2-Approximation for Submodular Maximization on Massive Data Streams

Many tasks in machine learning and data mining, such as data diversifica...
research
08/29/2018

Submodular Maximization with Packing Constraints in Parallel

We consider the problem of maximizing the multilinear extension of a sub...
research
05/24/2023

Fairness in Streaming Submodular Maximization over a Matroid Constraint

Streaming submodular maximization is a natural model for the task of sel...
research
07/20/2018

Submodular Maximization with Optimal Approximation, Adaptivity and Query Complexity

As a generalization of many classic problems in combinatorial optimizati...
research
02/10/2020

Regularized Submodular Maximization at Scale

In this paper, we propose scalable methods for maximizing a regularized ...

Please sign up or login with your details

Forgot password? Click here to reset