Synthesizing Diverse, High-Quality Audio Textures

06/20/2018
by   Joseph Antognini, et al.
0

Texture synthesis techniques based on matching the Gram matrix of feature activations in neural networks have achieved spectacular success in the image domain. In this paper we extend these techniques to the audio domain. We demonstrate that synthesizing diverse audio textures is challenging, and argue that this is because audio data is relatively low-dimensional. We therefore introduce two new terms to the original Grammian loss: an autocorrelation term that preserves rhythm, and a diversity term that encourages the optimization procedure to synthesize unique textures. We quantitatively study the impact of our design choices on the quality of the synthesized audio by introducing an audio analogue to the Inception loss which we term the VGGish loss. We show that there is a trade-off between the diversity and quality of the synthesized audio using this technique. We additionally perform a number of experiments to qualitatively study how these design choices impact the quality of the synthesized audio. Finally we describe the implications of these results for the problem of audio style transfer.

READ FULL TEXT
research
08/23/2022

Parameter Sensitivity of Deep-Feature based Evaluation Metrics for Audio Textures

Standard evaluation metrics such as the Inception score and Fréchet Audi...
research
01/29/2019

Applying Visual Domain Style Transfer and Texture Synthesis Techniques to Audio - Insights and Challenges

Style transfer is a technique for combining two images based on the acti...
research
05/09/2019

Sound texture synthesis using convolutional neural networks

The following article introduces a new parametric synthesis algorithm fo...
research
10/31/2017

Audio style transfer

"Style transfer" among images has recently emerged as a very active rese...
research
11/18/2019

Learning to Synthesize Fashion Textures

Existing unconditional generative models mainly focus on modeling genera...
research
11/13/2020

Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer

Domain randomisation is a very popular method for visual sim-to-real tra...
research
06/17/2018

Cover Song Synthesis by Analogy

In this work, we pose and address the following "cover song analogies" p...

Please sign up or login with your details

Forgot password? Click here to reset