Visualizing Topic Uncertainty in Topic Modelling

02/13/2023
by   Peter Winker, et al.
0

Word clouds became a standard tool for presenting results of natural language processing methods such as topic modelling. They exhibit most important words, where word size is often chosen proportional to the relevance of words within a topic. In the latent Dirichlet allocation (LDA) model, word clouds are graphical presentations of a vector of weights for words within a topic. These vectors are the result of a statistical procedure based on a specific corpus. Therefore, they are subject to uncertainty coming from different sources as sample selection, random components in the optimization algorithm, or parameter settings. A novel approach for presenting word clouds including information on such types of uncertainty is introduced and illustrated with an application of the LDA model to conference abstracts.

READ FULL TEXT

page 10

page 11

research
11/10/2014

Modeling Word Relatedness in Latent Dirichlet Allocation

Standard LDA model suffers the problem that the topic assignment of each...
research
05/15/2014

Topic words analysis based on LDA model

Social network analysis (SNA), which is a research field describing and ...
research
08/05/2015

Topic Stability over Noisy Sources

Topic modelling techniques such as LDA have recently been applied to spe...
research
06/17/2016

SMS Spam Filtering using Probabilistic Topic Modelling and Stacked Denoising Autoencoder

In This paper we present a novel approach to spam filtering and demonstr...
research
01/15/2020

VSEC-LDA: Boosting Topic Modeling with Embedded Vocabulary Selection

Topic modeling has found wide application in many problems where latent ...
research
10/14/2022

Word Clouds in the Wild

Word clouds are frequently used to analyze and communicate text data in ...
research
08/13/2016

Analysis of Morphology in Topic Modeling

Topic models make strong assumptions about their data. In particular, di...

Please sign up or login with your details

Forgot password? Click here to reset