The Causal News Corpus: Annotating Causal Relations in Event Sentences from News

04/25/2022
by   Fiona Anting Tan, et al.
28

Despite the importance of understanding causality, corpora addressing causal relations are limited. There is a discrepancy between existing annotation guidelines of event causality and conventional causality corpora that focus more on linguistics. Many guidelines restrict themselves to include only explicit relations or clause-based arguments. Therefore, we propose an annotation schema for event causality that addresses these concerns. We annotated 3,559 event sentences from protest event news with labels on whether it contains causal relations or not. Our corpus is known as the Causal News Corpus (CNC). A neural network built upon a state-of-the-art pre-trained language model performed well with 81.20 5-folds cross-validation. CNC is transferable across two external corpora: CausalTimeBank (CTB) and Penn Discourse Treebank (PDTB). Leveraging each of these external datasets for training, we achieved up to approximately 64 the CNC test set without additional fine-tuning. CNC also served as an effective training and pre-training dataset for the two external corpora. Lastly, we demonstrate the difficulty of our task to the layman in a crowd-sourced annotation exercise. Our annotated corpus is publicly available, providing a valuable resource for causal text mining researchers.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/29/2020

Towards Causality Extraction from Requirements

System behavior is often based on causal relations between certain event...
research
08/28/2021

HeadlineCause: A Dataset of News Headlines for Detecting Casualties

Detecting implicit causal relations in texts is a task that requires bot...
research
09/08/2022

IDIAPers @ Causal News Corpus 2022: Efficient Causal Relation Identification Through a Prompt-based Few-shot Approach

In this paper, we describe our participation in the subtask 1 of CASE-20...
research
07/19/2023

DAPrompt: Deterministic Assumption Prompt Learning for Event Causality Identification

Event Causality Identification (ECI) aims at determining whether there i...
research
09/08/2022

IDIAPers @ Causal News Corpus 2022: Extracting Cause-Effect-Signal Triplets via Pre-trained Autoregressive Language Model

In this paper, we describe our shared task submissions for Subtask 2 in ...
research
12/07/2017

A Corpus of Deep Argumentative Structures as an Explanation to Argumentative Relations

In this paper, we compose a new task for deep argumentative structure an...
research
11/22/2022

Event Causality Identification with Causal News Corpus – Shared Task 3, CASE 2022

The Event Causality Identification Shared Task of CASE 2022 involved two...

Please sign up or login with your details

Forgot password? Click here to reset