Lifelong Neural Topic Learning in Contextualized Autoregressive Topic Models of Language via Informative Transfers

09/29/2019
by   Yatin Chaudhary, et al.
0

Topic models such as LDA, DocNADE, iDocNADEe have been popular in document analysis. However, the traditional topic models have several limitations including: (1) Bag-of-words (BoW) assumption, where they ignore word ordering, (2) Data sparsity, where the application of topic models is challenging due to limited word co-occurrences, leading to incoherent topics and (3) No Continuous Learning framework for topic learning in lifelong fashion, exploiting historical knowledge (or latent topics) and minimizing catastrophic forgetting. This thesis focuses on addressing the above challenges within neural topic modeling framework. We propose: (1) Contextualized topic model that combines a topic and a language model and introduces linguistic structures (such as word ordering, syntactic and semantic features, etc.) in topic modeling, (2) A novel lifelong learning mechanism into neural topic modeling framework to demonstrate continuous learning in sequential document collections and minimizing catastrophic forgetting. Additionally, we perform a selective data augmentation to alleviate the need for complete historical corpora during data hallucination or replay.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/19/2020

Neural Topic Modeling with Continual Lifelong Learning

Lifelong learning has recently attracted attention in building machine l...
research
02/25/2010

Syntactic Topic Models

The syntactic topic model (STM) is a Bayesian nonparametric model of lan...
research
12/28/2017

Topic Compositional Neural Language Model

We propose a Topic Compositional Neural Language Model (TCNLM), a novel ...
research
11/20/2020

Topic modelling discourse dynamics in historical newspapers

This paper addresses methodological issues in diachronic data analysis f...
research
02/14/2012

Multidimensional counting grids: Inferring word order from disordered bags of words

Models of bags of words typically assume topic mixing so that the words ...
research
04/07/2017

Conceptualization Topic Modeling

Recently, topic modeling has been widely used to discover the abstract t...
research
02/12/2015

Ordering-sensitive and Semantic-aware Topic Modeling

Topic modeling of textual corpora is an important and challenging proble...

Please sign up or login with your details

Forgot password? Click here to reset