Acoustic correlates of the syllabic rhythm of speech: Modulation spectrum or local features of the temporal envelope

01/14/2023
by   Yuran Zhang, et al.
0

The syllable is a perceptually salient unit in speech. Since both the syllable and its acoustic correlate, i.e., the speech envelope, have a preferred range of rhythmicity between 4 and 8 Hz, it is hypothesized that theta-band neural oscillations play a major role in extracting syllables based on the envelope. A literature survey, however, reveals inconsistent evidence about the relationship between speech envelope and syllables, and the current study revisits this question by analyzing large speech corpora. It is shown that the center frequency of speech envelope, characterized by the modulation spectrum, reliably correlates with the rate of syllables only when the analysis is pooled over minutes of speech recordings. In contrast, in the time domain, a component of the speech envelope is reliably phase-locked to syllable onsets. Based on a speaker-independent model, the timing of syllable onsets explains about 24 features in the speech envelope, instead of the modulation spectrum, are a more reliable acoustic correlate of syllables.

READ FULL TEXT

page 30

page 31

page 36

research
03/31/2022

Importance of Different Temporal Modulations of Speech: A Tale of Two Perspectives

How important are different temporal speech modulations for speech recog...
research
07/28/2018

Articulatory Features for ASR of Pathological Speech

In this work, we investigate the joint use of articulatory and acoustic ...
research
02/16/2023

TAPLoss: A Temporal Acoustic Parameter Loss for Speech Enhancement

Speech enhancement models have greatly progressed in recent years, but s...
research
11/23/2018

Improved Frequency Modulation Features for Multichannel Distant Speech Recognition

Frequency modulation features capture the fine structure of speech forma...
research
04/23/2018

The Future of Prosody: It's about Time

Prosody is usually defined in terms of the three distinct but interactin...
research
03/24/2022

Complex Frequency Domain Linear Prediction: A Tool to Compute Modulation Spectrum of Speech

Conventional Frequency Domain Linear Prediction (FDLP) technique models ...
research
06/10/2022

Going Beyond the Cookie Theft Picture Test: Detecting Cognitive Impairments using Acoustic Features

Standardized tests play a crucial role in the detection of cognitive imp...

Please sign up or login with your details

Forgot password? Click here to reset