Inferring Pitch from Coarse Spectral Features

04/10/2022
by   Danni Ma, et al.
0

Fundamental frequency (F0) has long been treated as the physical definition of "pitch" in phonetic analysis. But there have been many demonstrations that F0 is at best an approximation to pitch, both in production and in perception: pitch is not F0, and F0 is not pitch. Changes in the pitch involve many articulatory and acoustic covariates; pitch perception often deviates from what F0 analysis predicts; and in fact, quasi-periodic signals from a single voice source are often incompletely characterized by an attempt to define a single time-varying F0. In this paper, we find strong support for the existence of covariates for pitch in aspects of relatively coarse spectra, in which an overtone series is not available. Thus linear regression can predict the pitch of simple vocalizations, produced by an articulatory synthesizer or by human, from single frames of such coarse spectra. Across speakers, and in more complex vocalizations, our experiments indicate that the covariates are not quite so simple, though apparently still available for more sophisticated modeling. On this basis, we propose that the field needs a better way of thinking about speech pitch, just as celestial mechanics requires us to go beyond Newton's point mass approximations to heavenly bodies.

READ FULL TEXT

page 2

page 4

research
05/31/2020

Maximum Voiced Frequency Estimation: Exploiting Amplitude and Phase Spectra

Maximum Voiced Frequency (MVF) is used in various speech models as the s...
research
09/14/2020

A study of vowel nasalization using instantaneous spectra

Nasalization of vowels is a phenomenon where oral and nasal tracts parti...
research
09/13/2023

Reorganization of the auditory-perceptual space across the human vocal range

We analyzed the auditory-perceptual space across a substantial portion o...
research
09/29/2021

Adaptive Bayesian Sum of Trees Model for Covariate Dependent Spectral Analysis

This article introduces a flexible and adaptive nonparametric method for...
research
10/07/2021

Sonorant spectra and coarticulation distinguish speakers with different dialects

The aim of this study is to determine the effect of language varieties o...
research
02/23/2018

Do WaveNets Dream of Acoustic Waves?

Various sources have reported the WaveNet deep learning architecture bei...

Please sign up or login with your details

Forgot password? Click here to reset