We introduce VampNet, a masked acoustic token modeling approach to music...
Despite phenomenal progress in recent years, state-of-the-art music
sepa...
Localizing visual sounds consists on locating the position of objects th...
We showcase an unsupervised method that repurposes deep models trained f...
We introduce the Free Universal Sound Separation (FUSS) dataset, a new c...
We propose a benchmark of state-of-the-art sound event detection systems...
Supervised deep learning methods for performing audio source separation ...
Clipping the gradient is a known approach to improving gradient descent,...
We present OtoWorld, an interactive environment in which agents must lea...
Performing sound event detection on real-world recordings often implies
...
Deep learning has rapidly become the state-of-the-art approach for music...
Deep generative systems that learn probabilistic models from a corpus of...
Audio source separation is the process of separating a mixture (e.g. a p...
Separating an audio scene such as a cocktail party into constituent,
mea...
We present a single deep learning architecture that can both separate an...
Music source separation performance has greatly improved in recent years...
Isolating individual instruments in a musical mixture has a myriad of
po...
Separating an audio scene into isolated sources is a fundamental problem...