Hierarchical I3D for Sign Spotting

10/03/2022
by   Ryan Wong, et al.
4

Most of the vision-based sign language research to date has focused on Isolated Sign Language Recognition (ISLR), where the objective is to predict a single sign class given a short video clip. Although there has been significant progress in ISLR, its real-life applications are limited. In this paper, we focus on the challenging task of Sign Spotting instead, where the goal is to simultaneously identify and localise signs in continuous co-articulated sign videos. To address the limitations of current ISLR-based models, we propose a hierarchical sign spotting approach which learns coarse-to-fine spatio-temporal sign features to take advantage of representations at various temporal levels and provide more precise sign localisation. Specifically, we develop Hierarchical Sign I3D model (HS-I3D) which consists of a hierarchical network head that is attached to the existing spatio-temporal I3D model to exploit features at different layers of the network. We evaluate HS-I3D on the ChaLearn 2022 Sign Spotting Challenge - MSSL track and achieve a state-of-the-art 0.607 F1 score, which was the top-1 winning solution of the competition.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2021

Using Motion History Images with 3D Convolutional Networks in Isolated Sign Language Recognition

Sign language recognition using computational models is a challenging pr...
research
09/01/2022

Topic Detection in Continuous Sign Language Videos

Significant progress has been made recently on challenging tasks in auto...
research
11/14/2021

Sign Language Translation with Hierarchical Spatio-TemporalGraph Neural Network

Sign language translation (SLT), which generates text in a spoken langua...
research
10/12/2020

TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation

Sign language translation (SLT) aims to interpret sign video sequences i...
research
11/25/2020

Sign language segmentation with temporal convolutional networks

The objective of this work is to determine the location of temporal boun...
research
10/01/2021

Phonology Recognition in American Sign Language

Inspired by recent developments in natural language processing, we propo...
research
01/12/2021

Context Matters: Self-Attention for Sign Language Recognition

This paper proposes an attentional network for the task of Continuous Si...

Please sign up or login with your details

Forgot password? Click here to reset