On Large-Scale Dynamic Topic Modeling with Nonnegative CP Tensor Decomposition

by   Miju Ahn, et al.

There is currently an unprecedented demand for large-scale temporal data analysis due to the explosive growth of data. Dynamic topic modeling has been widely used in social and data sciences with the goal of learning latent topics that emerge, evolve, and fade over time. Previous work on dynamic topic modeling primarily employ the method of nonnegative matrix factorization (NMF), where slices of the data tensor are each factorized into the product of lower-dimensional nonnegative matrices. With this approach, however, information contained in the temporal dimension of the data is often neglected or underutilized. To overcome this issue, we propose instead adopting the method of nonnegative CANDECOMP/PARAPAC (CP) tensor decomposition (NNCPD), where the data tensor is directly decomposed into a minimal sum of outer products of nonnegative vectors, thereby preserving the temporal information. The viability of NNCPD is demonstrated through application to both synthetic and real data, where significantly improved results are obtained compared to those of typical NMF-based methods. The advantages of NNCPD over such approaches are studied and discussed. To the best of our knowledge, this is the first time that NNCPD has been utilized for the purpose of dynamic topic modeling, and our findings will be transformative for both applications and further developments.


page 8

page 9

page 10

page 11

page 12

page 13

page 15

page 17


On Nonnegative Matrix and Tensor Decompositions for COVID-19 Twitter Dynamics

We analyze Twitter data relating to the COVID-19 pandemic using dynamic ...

A Generalized Hierarchical Nonnegative Tensor Decomposition

Nonnegative matrix factorization (NMF) has found many applications inclu...

Multi-scale Hybridized Topic Modeling: A Pipeline for Analyzing Unstructured Text Datasets via Topic Modeling

We propose a multi-scale hybridized topic modeling method to find hidden...

Online nonnegative tensor factorization and CP-dictionary learning for Markovian data

Nonnegative Matrix Factorization (NMF) algorithms are fundamental tools ...

Topic-aware chatbot using Recurrent Neural Networks and Nonnegative Matrix Factorization

We propose a novel model for a topic-aware chatbot by combining the trad...

Near-Convex Archetypal Analysis

Nonnegative matrix factorization (NMF) is a widely used linear dimension...

Unmixing dynamic PET images with variable specific binding kinetics

To analyze dynamic positron emission tomography (PET) images, various ge...

Please sign up or login with your details

Forgot password? Click here to reset