Simultaneous Translation for Unsegmented Input: A Sliding Window Approach

10/18/2022
by   Sukanta Sen, et al.
15

In the cascaded approach to spoken language translation (SLT), the ASR output is typically punctuated and segmented into sentences before being passed to MT, since the latter is typically trained on written text. However, erroneous segmentation, due to poor sentence-final punctuation by the ASR system, leads to degradation in translation quality, especially in the simultaneous (online) setting where the input is continuously updated. To reduce the influence of automatic segmentation, we present a sliding window approach to translate raw ASR outputs (online or offline) without needing to rely on an automatic segmenter. We train translation models using parallel windows (instead of parallel sentences) extracted from the original training data. At test time, we translate at the window level and join the translated windows using a simple approach to generate the final translation. Experiments on English-to-German and English-to-Czech show that our approach improves 1.3–2.0 BLEU points over the usual ASR-segmenter pipeline, and the fixed-length window considerably reduces flicker compared to a baseline retranslation-based online SLT system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/25/2020

Jointly Trained Transformers models for Spoken Language Translation

Conventional spoken language translation (SLT) systems are pipeline base...
research
09/13/2015

The USFD Spoken Language Translation System for IWSLT 2014

The University of Sheffield (USFD) participated in the International Wor...
research
10/22/2020

A Technical Report: BUT Speech Translation Systems

The paper describes the BUT's speech translation systems. The systems ar...
research
05/30/2020

Dynamic Masking for Improved Stability in Spoken Language Translation

For spoken language translation (SLT) in live scenarios such as conferen...
research
06/11/2021

Direct Simultaneous Speech-to-Text Translation Assisted by Synchronized Streaming ASR

Simultaneous speech-to-text translation is widely useful in many scenari...
research
07/24/2021

The USYD-JD Speech Translation System for IWSLT 2021

This paper describes the University of Sydney JD's joint submission o...
research
05/12/2022

AppTek's Submission to the IWSLT 2022 Isometric Spoken Language Translation Task

To participate in the Isometric Spoken Language Translation Task of the ...

Please sign up or login with your details

Forgot password? Click here to reset