Semantic Concept Spaces: Guided Topic Model Refinement using Word-Embedding Projections

by   Mennatallah El-Assady, et al.

We present a framework that allows users to incorporate the semantics of their domain knowledge for topic model refinement while remaining model-agnostic. Our approach enables users to (1) understand the semantic space of the model, (2) identify regions of potential conflicts and problems, and (3) readjust the semantic relation of concepts based on their understanding, directly influencing the topic modeling. These tasks are supported by an interactive visual analytics workspace that uses word-embedding projections to define concept regions which can then be refined. The user-refined concepts are independent of a particular document collection and can be transferred to related corpora. All user interactions within the concept space directly affect the semantic relations of the underlying vector space model, which, in turn, change the topic modeling. In addition to direct manipulation, our system guides the users' decision-making process through recommended interactions that point out potential improvements. This targeted refinement aims at minimizing the feedback required for an efficient human-in-the-loop process. We confirm the improvements achieved through our approach in two user studies that show topic model quality improvements through our visual knowledge externalization and learning process.


page 1

page 2

page 3

page 4


Human-in-the-loop Extraction of Interpretable Concepts in Deep Learning Models

The interpretation of deep neural networks (DNNs) has become a key topic...

A User-Centered, Interactive, Human-in-the-Loop Topic Modelling System

Human-in-the-loop topic modelling incorporates users' knowledge into the...

Representing Mixtures of Word Embeddings with Mixtures of Topic Embeddings

A topic model is often formulated as a generative model that explains ho...

Learning Concept Hierarchies through Probabilistic Topic Modeling

With the advent of semantic web, various tools and techniques have been ...

User Ex Machina : Simulation as a Design Probe in Human-in-the-Loop Text Analytics

Topic models are widely used analysis techniques for clustering document...

Exploratory topic modeling with distributional semantics

As we continue to collect and store textual data in a multitude of domai...

Why Didn't You Listen to Me? Comparing User Control of Human-in-the-Loop Topic Models

To address the lack of comparative evaluation of Human-in-the-Loop Topic...

Please sign up or login with your details

Forgot password? Click here to reset