Ermes: Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification

06/07/2018
by   Zhenpeng Chen, et al.
0

Most existing sentiment analysis approaches heavily rely on a large amount of labeled data that usually involve time-consuming and error-prone manual annotations. The distribution of this labeled data is significantly imbalanced among languages, e.g., more English texts are labeled than texts in other languages, which presents a major challenge to cross-lingual sentiment analysis. There have been several cross-lingual representation learning techniques that transfer the knowledge learned from a language with abundant labeled examples to another language with much fewer labels. Their performance, however, is usually limited due to the imperfect quality of machine translation and the scarce signal that bridges two languages. In this paper, we employ emojis, a ubiquitous and emotional language, as a new bridge for sentiment analysis across languages. Specifically, we propose a semi-supervised representation learning approach through the task of emoji prediction to learn cross-lingual representations of text that can capture both semantic and sentiment information. The learned representations are then utilized to facilitate cross-lingual sentiment classification. We demonstrate the effectiveness and efficiency of our approach on a representative Amazon review data set that covers three languages and three domains.

READ FULL TEXT
research
07/06/2017

Cross-Lingual Sentiment Analysis Without (Good) Translation

Current approaches to cross-lingual sentiment analysis try to leverage t...
research
03/22/2018

MultiBooked: A Corpus of Basque and Catalan Hotel Reviews Annotated for Aspect-level Sentiment Classification

While sentiment analysis has become an established field in the NLP comm...
research
09/18/2019

Text Length Adaptation in Sentiment Classification

Can a text classifier generalize well for datasets where the text length...
research
06/06/2016

Adversarial Deep Averaging Networks for Cross-Lingual Sentiment Classification

In recent years deep neural networks have achieved great success in sent...
research
07/04/2019

SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Software Engineering

Sentiment analysis has various application scenarios in software enginee...
research
03/07/2017

Leveraging Large Amounts of Weakly Supervised Data for Multi-Language Sentiment Classification

This paper presents a novel approach for multi-lingual sentiment classif...
research
10/04/2019

Contrastive Language Adaptation for Cross-Lingual Stance Detection

We study cross-lingual stance detection, which aims to leverage labeled ...

Please sign up or login with your details

Forgot password? Click here to reset