Behind the Scenes: An Exploration of Trigger Biases Problem in Few-Shot Event Classification

08/29/2021
by   Peiyi Wang, et al.
0

Few-Shot Event Classification (FSEC) aims at developing a model for event prediction, which can generalize to new event types with a limited number of annotated data. Existing FSEC studies have achieved high accuracy on different benchmarks. However, we find they suffer from trigger biases that signify the statistical homogeneity between some trigger words and target event types, which we summarize as trigger overlapping and trigger separability. The biases can result in context-bypassing problem, i.e., correct classifications can be gained by looking at only the trigger words while ignoring the entire context. Therefore, existing models can be weak in generalizing to unseen data in real scenarios. To further uncover the trigger biases and assess the generalization ability of the models, we propose two new sampling methods, Trigger-Uniform Sampling (TUS) and COnfusion Sampling (COS), for the meta tasks construction during evaluation. Besides, to cope with the context-bypassing problem in FSEC models, we introduce adversarial training and trigger reconstruction techniques. Experiments show these techniques help not only improve the performance, but also enhance the generalization ability of models.

READ FULL TEXT
research
09/13/2021

Honey or Poison? Solving the Trigger Curse in Few-shot Event Detection via Causal Intervention

Event detection has long been troubled by the trigger curse: overfitting...
research
02/28/2023

SMoA: Sparse Mixture of Adapters to Mitigate Multiple Dataset Biases

Recent studies reveal that various biases exist in different NLP tasks, ...
research
05/29/2023

Improving the Generalizability of Trajectory Prediction Models with Frenet-Based Domain Normalization

Predicting the future trajectories of nearby objects plays a pivotal rol...
research
03/25/2021

Deepfake Forensics via An Adversarial Game

With the progress in AI-based facial forgery (i.e., deepfake), people ar...
research
03/04/2022

ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and Classification

Generating new events given context with correlated ones plays a crucial...
research
06/30/2021

Exploring Context Modeling Techniques on the Spatiotemporal Crowd Flow Prediction

In the big data and AI era, context is widely exploited as extra informa...
research
05/16/2023

MsPrompt: Multi-step Prompt Learning for Debiasing Few-shot Event Detection

Event detection (ED) is aimed to identify the key trigger words in unstr...

Please sign up or login with your details

Forgot password? Click here to reset