embComp: Visual Interactive Comparison of Vector Embeddings

11/05/2019
by   Florian Heimerl, et al.
0

This work introduces embComp, a novel approach for comparing two embeddings that capture the similarity between objects, such as word and document embeddings. We survey scenarios where comparing these embedding spaces is useful. From those scenarios, we derive common tasks, introduce visual analysis methods that support these tasks, and combine them into a comprehensive system. One of embComp's central features are overview visualizations that are based on metrics for measuring differences in local structure around objects. Summarizing these local metrics over the embeddings provides global overviews of similarities and differences. These global views enable a user to identify sets of interesting objects whose relationships in the embeddings can be compared. Detail views allow comparison of the local structure around selected objects and relating this local information to the global views. Integrating and connecting all of these components, supports a range of analysis workflows that help understand similarities and differences between embedding spaces. We assess our approach by applying it in several use cases, including understanding corpora differences via word vector embeddings, and understanding algorithmic differences in generating embeddings.

READ FULL TEXT

page 1

page 5

page 6

page 8

page 10

page 11

research
12/10/2019

Embedding Comparator: Visualizing Differences in Global Structure and Local Neighborhoods via Small Multiples

Embeddings – mappings from high-dimensional discrete input to lower-dime...
research
05/16/2020

RPD: A Distance Function Between Word Embeddings

It is well-understood that different algorithms, training processes, and...
research
05/28/2019

Parallax: Visualizing and Understanding the Semantics of Embedding Spaces via Algebraic Formulae

Embeddings are a fundamental component of many modern machine learning a...
research
04/16/2020

Boxer: Interactive Comparison of Classifier Results

Machine learning practitioners often compare the results of different cl...
research
08/17/2022

Visual Comparison of Language Model Adaptation

Neural language models are widely used; however, their model parameters ...
research
05/10/2023

An adaptable JSON Diff Framework

In this paper, we present an implementation of JSON-diff framework JYCM,...
research
09/01/2018

Why is unsupervised alignment of English embeddings from different algorithms so hard?

This paper presents a challenge to the community: Generative adversarial...

Please sign up or login with your details

Forgot password? Click here to reset