Neural Topic Modeling with Continual Lifelong Learning

06/19/2020
by   Pankaj Gupta, et al.
0

Lifelong learning has recently attracted attention in building machine learning systems that continually accumulate and transfer knowledge to help future learning. Unsupervised topic modeling has been popularly used to discover topics from document collections. However, the application of topic modeling is challenging due to data sparsity, e.g., in a small collection of (short) documents and thus, generate incoherent topics and sub-optimal document representations. To address the problem, we propose a lifelong learning framework for neural topic modeling that can continuously process streams of document collections, accumulate topics and guide future topic modeling tasks by knowledge transfer from several sources to better deal with the sparse data. In the lifelong process, we particularly investigate jointly: (1) sharing generative homologies (latent topics) over lifetime to transfer prior knowledge, and (2) minimizing catastrophic forgetting to retain the past learning via novel selective data augmentation, co-training and topic regularization approaches. Given a stream of document collections, we apply the proposed Lifelong Neural Topic Modeling (LNTM) framework in modeling three sparse document collections as future tasks and demonstrate improved performance quantified by perplexity, topic coherence and information retrieval task.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/14/2019

Multi-view and Multi-source Transfers in Neural Topic Modeling

Though word embeddings and topics are complementary representations, sev...
research
09/29/2019

Lifelong Neural Topic Learning in Contextualized Autoregressive Topic Models of Language via Informative Transfers

Topic models such as LDA, DocNADE, iDocNADEe have been popular in docume...
research
12/05/2022

Federated Neural Topic Models

Over the last years, topic modeling has emerged as a powerful technique ...
research
10/05/2020

Improving Neural Topic Models using Knowledge Distillation

Topic models are often used to identify human-interpretable topics to he...
research
04/17/2021

Multi-source Neural Topic Modeling in Multi-view Embedding Spaces

Though word embeddings and topics are complementary representations, sev...
research
05/18/2022

Topic Segmentation of Research Article Collections

Collections of research article data harvested from the web have become ...
research
03/24/2021

Topic Modeling Genre: An Exploration of French Classical and Enlightenment Drama

The concept of literary genre is a highly complex one: not only are diff...

Please sign up or login with your details

Forgot password? Click here to reset