Exploring dual information in distance metric learning for clustering

05/26/2021
by   Rodrigo Randel, et al.
0

Distance metric learning algorithms aim to appropriately measure similarities and distances between data points. In the context of clustering, metric learning is typically applied with the assist of side-information provided by experts, most commonly expressed in the form of cannot-link and must-link constraints. In this setting, distance metric learning algorithms move closer pairs of data points involved in must-link constraints, while pairs of points involved in cannot-link constraints are moved away from each other. For these algorithms to be effective, it is important to use a distance metric that matches the expert knowledge, beliefs, and expectations, and the transformations made to stick to the side-information should preserve geometrical properties of the dataset. Also, it is interesting to filter the constraints provided by the experts to keep only the most useful and reject those that can harm the clustering process. To address these issues, we propose to exploit the dual information associated with the pairwise constraints of the semi-supervised clustering problem. Experiments clearly show that distance metric learning algorithms benefit from integrating this dual information.

READ FULL TEXT

page 12

page 13

research
03/11/2016

Nonstationary Distance Metric Learning

Recent work in distance metric learning has focused on learning transfor...
research
12/09/2019

Expert-guided Regularization via Distance Metric Learning

High-dimensional prediction is a challenging problem setting for traditi...
research
01/07/2017

Similarity Function Tracking using Pairwise Comparisons

Recent work in distance metric learning has focused on learning transfor...
research
07/01/2019

Learning to Link

Clustering is an important part of many modern data analysis pipelines, ...
research
03/30/2020

Secure Metric Learning via Differential Pairwise Privacy

Distance Metric Learning (DML) has drawn much attention over the last tw...
research
07/13/2013

Learning an Integrated Distance Metric for Comparing Structure of Complex Networks

Graph comparison plays a major role in many network applications. We oft...
research
04/05/2021

Semi-Supervised Clustering with Inaccurate Pairwise Annotations

Pairwise relational information is a useful way of providing partial sup...

Please sign up or login with your details

Forgot password? Click here to reset