A Cascade Sequence-to-Sequence Model for Chinese Mandarin Lip Reading

08/14/2019
by   Ya Zhao, et al.
0

Lip reading aims at decoding texts from the movement of a speaker's mouth. In recent years, lip reading methods have made great progress for English, at both word-level and sentence-level. Unlike English, however, Chinese Mandarin is a tone-based language and relies on pitches to distinguish lexical or grammatical meaning, which significantly increases the ambiguity for the lip reading task. In this paper, we propose a Cascade Sequence-to-Sequence Model for Chinese Mandarin (CSSMCM) lip reading, which explicitly models tones when predicting sentence. Tones are modeled based on visual information and syntactic structure, and are used to predict sentence along with visual information and syntactic structure. In order to evaluate CSSMCM, a dataset called CMLR (Chinese Mandarin Lip Reading) is collected and released, consisting of over 100,000 natural sentences from China Network Television website. When trained on CMLR dataset, the proposed CSSMCM surpasses the performance of state-of-the-art lip reading frameworks, which confirms the effectiveness of explicit modeling of tones for Chinese Mandarin lip reading.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2018

Automatic Transferring between Ancient Chinese and Contemporary Chinese

During the long time of development, Chinese language has evolved a grea...
research
04/10/2021

Lip reading using external viseme decoding

Lip-reading is the operation of recognizing speech from lip movements. T...
research
10/04/2020

Meta Sequence Learning and Its Applications

We present a meta-sequence representation of sentences and demonstrate h...
research
03/01/2021

Cryptonite: A Cryptic Crossword Benchmark for Extreme Ambiguity in Language

Current NLP datasets targeting ambiguity can be solved by a native speak...
research
03/23/2021

Annotation of Chinese Predicate Heads and Relevant Elements

A predicate head is a verbal expression that plays a role as the structu...
research
04/08/2023

Word-level Persian Lipreading Dataset

Lip-reading has made impressive progress in recent years, driven by adva...
research
08/29/2018

Characterizing the Influence of Features on Reading Difficulty Estimation for Non-native Readers

In recent years, the number of people studying English as a second langu...

Please sign up or login with your details

Forgot password? Click here to reset