RWCP-SSD-Onomatopoeia: Onomatopoeic Word Dataset for Environmental Sound Synthesis

07/09/2020
by   Yuki Okamoto, et al.
0

Environmental sound synthesis is a technique for generating a natural environmental sound. Conventional work on environmental sound synthesis using sound event labels cannot finely control synthesized sounds, for example, the pitch and timbre. We consider that onomatopoeic words can be used for environmental sound synthesis. Onomatopoeic words are effective for explaining the feature of sounds. We believe that using onomatopoeic words will enable us to control the fine time-frequency structure of synthesized sounds. However, there is no dataset available for environmental sound synthesis using onomatopoeic words. In this paper, we thus present RWCP-SSD-Onomatopoeia, a dataset consisting of 155,568 onomatopoeic words paired with audio samples for environmental sound synthesis. We also collected self-reported confidence scores and others-reported acceptance scores of onomatopoeic words, to help us investigate the difficulty in the transcription and selection of a suitable word for environmental sound synthesis.

READ FULL TEXT

page 1

page 2

page 3

page 4

page 5

research
02/11/2021

Onoma-to-wave: Environmental sound synthesis from onomatopoeic words

In this paper, we propose a new framework for environmental sound synthe...
research
04/29/2023

Environmental sound conversion from vocal imitations and sound event labels

One way of expressing an environmental sound is using vocal imitations, ...
research
05/28/2023

CAPTDURE: Captioned Sound Dataset of Single Sources

In conventional studies on environmental sound separation and synthesis ...
research
11/20/2018

Sound-Stream II: Towards Real-Time Gesture Controlled Articulatory Sound Synthesis

We present an interface involving four degrees-of-freedom (DOF) mechanic...
research
04/17/2022

Advances in Thunder Sound Synthesis

A recent comparative study evaluated all known thunder synthesis techniq...
research
01/01/2022

Generating Adversarial Samples For Training Wake-up Word Detection Systems Against Confusing Words

Wake-up word detection models are widely used in real life, but suffer f...
research
11/30/2022

Extreme Audio Time Stretching Using Neural Synthesis

A deep neural network solution for time-scale modification (TSM) focused...

Please sign up or login with your details

Forgot password? Click here to reset