Stochastic Optimization of Sorting Networks via Continuous Relaxations

03/21/2019
by   Aditya Grover, et al.
8

Sorting input objects is an important step in many machine learning pipelines. However, the sorting operator is non-differentiable with respect to its inputs, which prohibits end-to-end gradient-based optimization. In this work, we propose NeuralSort, a general-purpose continuous relaxation of the output of the sorting operator from permutation matrices to the set of unimodal row-stochastic matrices, where every row sums to one and has a distinct arg max. This relaxation permits straight-through optimization of any computational graph involve a sorting operation. Further, we use this relaxation to enable gradient-based stochastic optimization over the combinatorially large space of permutations by deriving a reparameterized gradient estimator for the Plackett-Luce family of distributions over permutations. We demonstrate the usefulness of our framework on three tasks that require learning semantic orderings of high-dimensional objects, including a fully differentiable, parameterized extension of the k-nearest neighbors algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/17/2022

Monotonic Differentiable Sorting Networks

Differentiable sorting algorithms allow training with sorting and rankin...
research
06/29/2020

SoftSort: A Continuous Relaxation for the argsort Operator

While sorting is an important procedure in computer science, the argsort...
research
10/26/2017

Reparameterizing the Birkhoff Polytope for Variational Permutation Inference

Many matching, tracking, sorting, and ranking problems require probabili...
research
02/20/2020

Learning with Differentiable Perturbed Optimizers

Machine learning pipelines often rely on optimization procedures to make...
research
12/12/2020

PiRank: Learning To Rank via Differentiable Sorting

A key challenge with machine learning approaches for ranking is the gap ...
research
03/14/2022

Permutation Invariant Representations with Applications to Graph Deep Learning

This paper presents primarily two Euclidean embeddings of the quotient s...
research
05/26/2023

Differentiable Random Partition Models

Partitioning a set of elements into an unknown number of mutually exclus...

Please sign up or login with your details

Forgot password? Click here to reset