A Survey on Text Classification: From Shallow to Deep Learning

08/02/2020
by   Qian Li, et al.
0

Text classification is the most fundamental and essential task in natural language processing. The last decade has seen a surge of research in this area due to the unprecedented success of deep learning. Numerous methods, datasets, and evaluation metrics have been proposed in the literature, raising the need for a comprehensive and updated survey. This paper fills the gap by reviewing the state of the art approaches from 1961 to 2020, focusing on models from shallow to deep learning. We create a taxonomy for text classification according to the text involved and the models used for feature extraction and classification. We then discuss each of these categories in detail, dealing with both the technical developments and benchmark datasets that support tests of predictions. A comprehensive comparison between different techniques, as well as identifying the pros and cons of various evaluation metrics are also provided in this survey. Finally, we conclude by summarizing key implications, future research directions, and the challenges facing the research area.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/05/2021

Deep Learning Schema-based Event Extraction: Literature Review and Current Trends

Schema-based event extraction is a critical technique to apprehend the e...
research
04/23/2023

Graph Neural Networks for Text Classification: A Survey

Text Classification is the most essential and fundamental problem in Nat...
research
05/23/2023

Out-of-Distribution Generalization in Text Classification: Past, Present, and Future

Machine learning (ML) systems in natural language processing (NLP) face ...
research
04/06/2020

Deep Learning Based Text Classification: A Comprehensive Review

Deep learning based models have surpassed classical machine learning bas...
research
07/20/2021

Data Hiding with Deep Learning: A Survey Unifying Digital Watermarking and Steganography

Data hiding is the process of embedding information into a noise-toleran...
research
06/01/2018

Video Description: A Survey of Methods, Datasets and Evaluation Metrics

Automatic video description is useful for assisting the visually impaire...
research
02/18/2019

Classifying textual data: shallow, deep and ensemble methods

This paper focuses on a comparative evaluation of the most common and mo...

Please sign up or login with your details

Forgot password? Click here to reset