VOP Detection for Read and Conversation Speech using CWT Coefficients and Phone Boundaries

08/23/2019
by   Kumud Tripathi, et al.
0

In this paper, we propose a novel approach for accurate detection of the vowel onset points (VOPs). VOP is the instant at which the vowel begins in the speech signal. Precise identification of VOPs is important for various speech applications such as speech segmentation and speech rate modification. The existing methods detect the majority of VOPs within 40 ms deviation, and it may not be appropriate for the above speech applications. To address this issue, we proposed a two-stage approach for accurate detection of VOPs. At the first stage, VOPs are detected using continuous wavelet transform coefficients, and the position of the detected VOPs are corrected using the phone boundaries in the second stage. The phone boundaries are detected by the spectral transition measure method. Experiments are done using TIMIT and Bengali speech corpora. Performance of the proposed approach is compared with two standard signal processing based methods. The evaluation results show that the proposed method performs better than the existing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/29/2020

Robust Phonetic Segmentation Using Spectral Transition measure for Non-Standard Recording Environments

Phone level localization of mis-articulation is a key requirement for an...
research
06/15/2020

Catplayinginthesnow: Impact of Prior Segmentation on a Model of Visually Grounded Speech

We investigate the effect of introducing phone, syllable, or word bounda...
research
10/07/2015

Hierarchical Representation of Prosody for Statistical Speech Synthesis

Prominences and boundaries are the essential constituents of prosodic st...
research
03/29/2022

Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations

This paper presents a macroscopic approach to automatic detection of spe...
research
04/02/2021

Unsupervised Acoustic Unit Discovery by Leveraging a Language-Independent Subword Discriminative Feature Representation

This paper tackles automatically discovering phone-like acoustic units (...
research
04/14/2019

SpeechYOLO: Detection and Localization of Speech Objects

In this paper, we propose to apply object detection methods from the vis...
research
06/16/2019

Using Automatically Extracted Minimum Spans to Disentangle Coreference Evaluation from Boundary Detection

The common practice in coreference resolution is to identify and evaluat...

Please sign up or login with your details

Forgot password? Click here to reset