Comprehending Real Numbers: Development of Bengali Real Number Speech Corpus

03/27/2018
by   Md Mahadi Hasan Nahid, et al.
0

Speech recognition has received a less attention in Bengali literature due to the lack of a comprehensive dataset. In this paper, we describe the development process of the first comprehensive Bengali speech dataset on real numbers. It comprehends all the possible words that may arise in uttering any Bengali real number. The corpus has ten speakers from the different regions of Bengali native people. It comprises of more than two thousands of speech samples in a total duration of closed to four hours. We also provide a deep analysis of our corpus, highlight some of the notable features of it, and finally evaluate the performances of two of the notable Bengali speech recognizers on it.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2022

Huqariq: A Multilingual Speech Corpus of Native Languages of Peru for Speech Recognition

The Huqariq corpus is a multilingual collection of speech from native Pe...
research
07/30/2021

USC: An Open-Source Uzbek Speech Corpus and Initial Speech Recognition Experiments

We present a freely available speech corpus for the Uzbek language and r...
research
07/17/2023

ivrit.ai: A Comprehensive Dataset of Hebrew Speech for AI Research and Development

We introduce "ivrit.ai", a comprehensive Hebrew speech dataset, addressi...
research
01/20/2021

VOTE400(Voide Of The Elderly 400 Hours): A Speech Dataset to Study Voice Interface for Elderly-Care

This paper introduces a large-scale Korean speech dataset, called VOTE40...
research
07/16/2019

RadioTalk: a large-scale corpus of talk radio transcripts

We introduce RadioTalk, a corpus of speech recognition transcripts sampl...
research
09/22/2020

A Crowdsourced Open-Source Kazakh Speech Corpus and Initial Speech Recognition Baseline

We present an open-source speech corpus for the Kazakh language. The Kaz...
research
02/20/2021

The Use of Voice Source Features for Sung Speech Recognition

In this paper, we ask whether vocal source features (pitch, shimmer, jit...

Please sign up or login with your details

Forgot password? Click here to reset