Topic Similarity Networks: Visual Analytics for Large Document Sets

09/26/2014
by   Arun S. Maiya, et al.
0

We investigate ways in which to improve the interpretability of LDA topic models by better analyzing and visualizing their outputs. We focus on examining what we refer to as topic similarity networks: graphs in which nodes represent latent topics in text collections and links represent similarity among topics. We describe efficient and effective approaches to both building and labeling such networks. Visualizations of topic models based on these networks are shown to be a powerful means of exploring, characterizing, and summarizing large collections of unstructured text documents. They help to "tease out" non-obvious connections among different sets of documents and provide insights into how topics form larger themes. We demonstrate the efficacy and practicality of these approaches through two case studies: 1) NSF grants for basic research spanning a 14 year period and 2) the entire English portion of Wikipedia.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2011

TopicViz: Semantic Navigation of Document Collections

When people explore and manage information, they think in terms of topic...
research
03/15/2012

Timeline: A Dynamic Hierarchical Dirichlet Process Model for Recovering Birth/Death and Evolution of Topics in Text Stream

Topic models have proven to be a useful tool for discovering latent stru...
research
01/30/2018

Creative Exploration Using Topic Based Bisociative Networks

Bisociative knowledge discovery is an approach that combines elements fr...
research
06/30/2021

Multilayer Networks for Text Analysis with Multiple Data Types

We are interested in the widespread problem of clustering documents and ...
research
08/19/2015

Fast, Flexible Models for Discovering Topic Correlation across Weakly-Related Collections

Weak topic correlation across document collections with different number...
research
02/03/2014

A high-reproducibility and high-accuracy method for automated topic classification

Much of human knowledge sits in large databases of unstructured text. Le...
research
03/29/2018

Computer-Assisted Text Analysis for Social Science: Topic Models and Beyond

Topic models are a family of statistical-based algorithms to summarize, ...

Please sign up or login with your details

Forgot password? Click here to reset