WebQAmGaze: A Multilingual Webcam Eye-Tracking-While-Reading Dataset

by   Tiago Ribeiro, et al.
Københavns Uni

We create WebQAmGaze, a multilingual low-cost eye-tracking-while-reading dataset, designed to support the development of fair and transparent NLP models. WebQAmGaze includes webcam eye-tracking data from 332 participants naturally reading English, Spanish, and German texts. Each participant performs two reading tasks composed of five texts, a normal reading and an information-seeking task. After preprocessing the data, we find that fixations on relevant spans seem to indicate correctness when answering the comprehension questions. Additionally, we perform a comparative analysis of the data collected to high-quality eye-tracking data. The results show a moderate correlation between the features obtained with the webcam-ET compared to those of a commercial ET device. We believe this data can advance webcam-based reading studies and open a way to cheaper and more accessible data collection. WebQAmGaze is useful to learn about the cognitive processes behind question answering (QA) and to apply these insights to computational models of language understanding.


page 1

page 2

page 3

page 4


The Copenhagen Corpus of Eye Tracking Recordings from Natural Reading of Danish Texts

Eye movement recordings from reading are one of the richest signals of h...

TorontoCL at CMCL 2021 Shared Task: RoBERTa with Multi-Stage Fine-Tuning for Eye-Tracking Prediction

Eye movement data during reading is a useful source of information for u...

PeyeDF: an Eye-Tracking Application for Reading and Self-Indexing Research

PeyeDF is a Portable Document Format (PDF) reader with eye tracking supp...

Zero Shot Crosslingual Eye-Tracking Data Prediction using Multilingual Transformer Models

Eye tracking data during reading is a useful source of information to un...

A Novel Slip-Kalman Filter to Track the Progression of Reading Through Eye-Gaze Measurements

In this paper, we propose an approach to track the progression of eye-ga...

CARE: Collaborative AI-Assisted Reading Environment

Recent years have seen impressive progress in AI-assisted writing, yet t...

Every word counts: A multilingual analysis of individual human alignment with model attention

Human fixation patterns have been shown to correlate strongly with Trans...

Please sign up or login with your details

Forgot password? Click here to reset