Similarity measures for time series are important problems for time seri...
We proposed Audio Difference Captioning (ADC) as a new extension task of...
Self-supervised learning general-purpose audio representations have
demo...
Masked Autoencoders is a simple yet powerful self-supervised learning me...
Despite recent advances in image enhancement, it remains difficult for
e...
Existing image enhancement methods fall short of expectations because wi...
We propose a novel framework for target speech extraction based on seman...
The amount of audio data available on public websites is growing rapidly...
Many application studies rely on audio DNN models pre-trained on a
large...
Recent general-purpose audio representations show state-of-the-art
perfo...
Pre-trained models are essential as feature extractors in modern machine...
Deep time series metric learning is challenging due to the difficult
tra...
Inspired by the recent progress in self-supervised learning for computer...
The system we used for Task 6 (Automated Audio Captioning)of the Detecti...
This technical report describes the system participating to the Detectio...
This paper proposes the decision tree latent controller generative
adver...
Interpretability has become an important issue in the machine learning f...
A layered neural network is now one of the most common choices for the
p...
Layered neural networks have greatly improved the performance of various...
Recent studies in the field of human vision science suggest that the hum...