Gloss Alignment Using Word Embeddings

08/08/2023
by   Harry Walsh, et al.
0

Capturing and annotating Sign language datasets is a time consuming and costly process. Current datasets are orders of magnitude too small to successfully train unconstrained slt models. As a result, research has turned to TV broadcast content as a source of large-scale training data, consisting of both the sign language interpreter and the associated audio subtitle. However, lack of sign language annotation limits the usability of this data and has led to the development of automatic annotation techniques such as sign spotting. These spottings are aligned to the video rather than the subtitle, which often results in a misalignment between the subtitle and spotted signs. In this paper we propose a method for aligning spottings with their corresponding subtitles using large spoken language models. Using a single modality means our method is computationally inexpensive and can be utilized in conjunction with existing alignment techniques. We quantitatively demonstrate the effectiveness of our method on the mdgs and bobsl datasets, recovering up to a 33.22 BLEU-1 score in word alignment.

READ FULL TEXT

page 2

page 3

research
08/04/2022

Automatic dense annotation of large-vocabulary sign language videos

Recently, sign language researchers have turned to sign language interpr...
research
05/06/2021

Aligning Subtitles in Sign Language Videos

The goal of this work is to temporally align asynchronous subtitles in s...
research
08/18/2023

Learnt Contrastive Concept Embeddings for Sign Recognition

In natural language processing (NLP) of spoken languages, word embedding...
research
04/13/2023

Sign Language Translation from Instructional Videos

The advances in automatic sign language translation (SLT) to spoken lang...
research
01/07/2022

Sign Language Video Retrieval with Free-Form Textual Queries

Systems that can efficiently search collections of sign language videos ...
research
05/05/2021

Content4All Open Research Sign Language Translation Datasets

Computational sign language research lacks the large-scale datasets that...
research
10/11/2022

Analyzing Text Representations under Tight Annotation Budgets: Measuring Structural Alignment

Annotating large collections of textual data can be time consuming and e...

Please sign up or login with your details

Forgot password? Click here to reset