Kunio Kashino

research

∙ 09/13/2023

Deep Attentive Time Warping

Similarity measures for time series are important problems for time seri...

0 Shinnosuke Matsuo, et al. ∙

research

∙ 08/23/2023

Audio Difference Captioning Utilizing Similarity-Discrepancy Disentanglement

We proposed Audio Difference Captioning (ADC) as a new extension task of...

0 Daiki Takeuchi, et al. ∙

research

∙ 05/23/2023

Masked Modeling Duo for Speech: Specializing General-Purpose Audio Representation to Speech using Denoising Distillation

Self-supervised learning general-purpose audio representations have demo...

0 Daisuke Niizumi, et al. ∙

research

∙ 10/26/2022

Masked Modeling Duo: Learning Representations by Encouraging Both Networks to Model the Input

Masked Autoencoders is a simple yet powerful self-supervised learning me...

0 Daisuke Niizumi, et al. ∙

research

∙ 09/14/2022

Reflectance-Oriented Probabilistic Equalization for Image Enhancement

Despite recent advances in image enhancement, it remains difficult for e...

0 Xiaomeng Wu, et al. ∙

research

∙ 09/14/2022

Reflectance-Guided, Contrast-Accumulated Histogram Equalization

Existing image enhancement methods fall short of expectations because wi...

0 Xiaomeng Wu, et al. ∙

research

∙ 07/25/2022

ConceptBeam: Concept Driven Target Speech Extraction

We propose a novel framework for target speech extraction based on seman...

0 Yasunori Ohishi, et al. ∙

research

∙ 07/20/2022

Introducing Auxiliary Text Query-modifier to Content-based Audio Retrieval

The amount of audio data available on public websites is growing rapidly...

0 Daiki Takeuchi, et al. ∙

research

∙ 05/17/2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model

Many application studies rely on audio DNN models pre-trained on a large...

0 Daisuke Niizumi, et al. ∙

research

∙ 04/26/2022

Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representation

Recent general-purpose audio representations show state-of-the-art perfo...

0 Daisuke Niizumi, et al. ∙

research

∙ 04/15/2022

BYOL for Audio: Exploring Pre-trained General-purpose Audio Representations

Pre-trained models are essential as feature extractors in modern machine...

0 Daisuke Niizumi, et al. ∙

research

∙ 03/28/2021

Attention to Warp: Deep Metric Learning for Multivariate Time Series

Deep time series metric learning is challenging due to the difficult tra...

0 Shinnosuke Matsuo, et al. ∙

research

∙ 03/11/2021

BYOL for Audio: Self-Supervised Learning for General-Purpose Audio Representation

Inspired by the recent progress in self-supervised learning for computer...

0 Daisuke Niizumi, et al. ∙

research

∙ 09/24/2020

Effects of Word-frequency based Pre- and Post- Processings for Audio Captioning

The system we used for Task 6 (Automated Audio Captioning)of the Detecti...

0 Daiki Takeuchi, et al. ∙

research

∙ 07/01/2020

The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation

This technical report describes the system participating to the Detectio...

0 Yuma Koizumi, et al. ∙

research

∙ 05/27/2018

Generative Adversarial Image Synthesis with Decision Tree Latent Controller

This paper proposes the decision tree latent controller generative adver...

0 Takuhiro Kaneko, et al. ∙

research

∙ 05/18/2018

Knowledge Discovery from Layered Neural Networks based on Non-negative Task Decomposition

Interpretability has become an important issue in the machine learning f...

0 Chihiro Watanabe, et al. ∙

research

∙ 04/13/2018

Understanding Community Structure in Layered Neural Networks

A layered neural network is now one of the most common choices for the p...

0 Chihiro Watanabe, et al. ∙

research

∙ 03/01/2017

Modular Representation of Layered Neural Networks

Layered neural networks have greatly improved the performance of various...

0 Chihiro Watanabe, et al. ∙

research

∙ 04/01/2010

A stochastic model of human visual attention with a dynamic Bayesian network

Recent studies in the field of human vision science suggest that the hum...

0 Akisato Kimura, et al. ∙

Kunio Kashino

Featured Co-authors

Sign in with Google

Consider DeepAI Pro