Do Children Texts Hold The Key To Commonsense Knowledge?

10/10/2022
by   Julien Romero, et al.
0

Compiling comprehensive repositories of commonsense knowledge is a long-standing problem in AI. Many concerns revolve around the issue of reporting bias, i.e., that frequency in text sources is not a good proxy for relevance or truth. This paper explores whether children's texts hold the key to commonsense knowledge compilation, based on the hypothesis that such content makes fewer assumptions on the reader's knowledge, and therefore spells out commonsense more explicitly. An analysis with several corpora shows that children's texts indeed contain much more, and more typical commonsense assertions. Moreover, experiments show that this advantage can be leveraged in popular language-model-based commonsense knowledge extraction settings, where task-unspecific fine-tuning on small amounts of children texts (childBERT) already yields significant improvements. This provides a refreshing perspective different from the common trend of deriving progress from ever larger models and corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2022

Visually Grounded Commonsense Knowledge Acquisition

Large-scale commonsense knowledge bases empower a broad range of AI appl...
research
01/02/2021

KM-BART: Knowledge Enhanced Multimodal BART for Visual Commonsense Generation

We present Knowledge Enhanced Multimodal BART (KM-BART), which is a Tran...
research
03/14/2018

MCScript: A Novel Dataset for Assessing Machine Comprehension Using Script Knowledge

We introduce a large dataset of narrative texts and questions about thes...
research
10/10/2020

Beyond Language: Learning Commonsense from Images for Reasoning

This paper proposes a novel approach to learn commonsense from images, i...
research
09/02/2019

Commonsense Knowledge Mining from Pretrained Models

Inferring commonsense knowledge is a key challenge in natural language p...
research
03/06/2020

On the Role of Conceptualization in Commonsense Knowledge Graph Construction

Commonsense knowledge graphs (CKG) like Atomic and ASER are substantiall...
research
11/15/2022

kogito: A Commonsense Knowledge Inference Toolkit

In this paper, we present kogito, an open-source tool for generating com...

Please sign up or login with your details

Forgot password? Click here to reset