Toward American Sign Language Processing in the Real World: Data, Tasks, and Methods

08/23/2023
by   Bowen Shi, et al.
0

Sign language, which conveys meaning through gestures, is the chief means of communication among deaf people. Recognizing sign language in natural settings presents significant challenges due to factors such as lighting, background clutter, and variations in signer characteristics. In this thesis, I study automatic sign language processing in the wild, using signing videos collected from the Internet. This thesis contributes new datasets, tasks, and methods. Most chapters of this thesis address tasks related to fingerspelling, an important component of sign language and yet has not been studied widely by prior work. I present three new large-scale ASL datasets in the wild: ChicagoFSWild, ChicagoFSWild+, and OpenASL. Using ChicagoFSWild and ChicagoFSWild+, I address fingerspelling recognition, which consists of transcribing fingerspelling sequences into text. I propose an end-to-end approach based on iterative attention that allows recognition from a raw video without explicit hand detection. I further show that using a Conformer-based network jointly modeling handshape and mouthing can bring performance close to that of humans. Next, I propose two tasks for building real-world fingerspelling-based applications: fingerspelling detection and search. For fingerspelling detection, I introduce a suite of evaluation metrics and a new detection model via multi-task training. To address the problem of searching for fingerspelled keywords in raw sign language videos, we propose a novel method that jointly localizes and matches fingerspelling segments to text. Finally, I will describe a benchmark for large-vocabulary open-domain sign language translation based on OpenASL. To address the challenges of sign language translation in realistic settings, we propose a set of techniques including sign search as a pretext task for pre-training and fusion of mouthing and handshape features.

READ FULL TEXT

page 29

page 30

page 33

page 35

page 37

page 39

research
03/24/2022

Searching for fingerspelled content in American Sign Language

Natural language processing for sign language video - including tasks li...
research
04/03/2021

Fingerspelling Detection in American Sign Language

Fingerspelling, in which words are signed letter by letter, is an import...
research
03/11/2022

WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language

Signed Language Processing (SLP) concerns the automated processing of si...
research
10/26/2018

American Sign Language fingerspelling recognition in the wild

We address the problem of American Sign Language fingerspelling recognit...
research
03/19/2023

On the Importance of Signer Overlap for Sign Language Detection

Sign language detection, identifying if someone is signing or not, is be...
research
08/28/2019

Fingerspelling recognition in the wild with iterative visual attention

Sign language recognition is a challenging gesture sequence recognition ...
research
05/17/2021

A Fine-Grained Visual Attention Approach for Fingerspelling Recognition in the Wild

Fingerspelling in sign language has been the means of communicating tech...

Please sign up or login with your details

Forgot password? Click here to reset