Label Tree Embeddings for Acoustic Scene Classification

06/25/2016
by   Lars Hertel, et al.
0

We present in this paper an efficient approach for acoustic scene classification by exploring the structure of class labels. Given a set of class labels, a category taxonomy is automatically learned by collectively optimizing a clustering of the labels into multiple meta-classes in a tree structure. An acoustic scene instance is then embedded into a low-dimensional feature representation which consists of the likelihoods that it belongs to the meta-classes. We demonstrate state-of-the-art results on two different datasets for the acoustic scene classification task, including the DCASE 2013 and LITIS Rouen datasets.

READ FULL TEXT

page 1

page 2

page 3

research
09/05/2018

CNNs-based Acoustic Scene Classification using Multi-Spectrogram Fusion and Label Expansions

Spectrograms have been widely used in Convolutional Neural Networks base...
research
07/08/2016

CNN-LTE: a Class of 1-X Pooling Convolutional Neural Networks on Label Tree Embeddings for Audio Scene Recognition

We describe in this report our audio scene recognition system submitted ...
research
04/15/2021

Attentive Max Feature Map for Acoustic Scene Classification with Joint Learning considering the Abstraction of Classes

The attention mechanism has been widely adopted in acoustic scene classi...
research
04/23/2019

Acoustic scene classification using teacher-student learning with soft-labels

Acoustic scene classification identifies an input segment into one of th...
research
06/09/2023

Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration

Recent efforts have been made on acoustic scene classification in the au...
research
03/16/2022

Instance-level loss based multiple-instance learning for acoustic scene classification

In acoustic scene classification (ASC) task, an acoustic scene consists ...
research
09/26/2018

An extensible cluster-graph taxonomy for open set sound scene analysis

We present a new extensible and divisible taxonomy for open set sound sc...

Please sign up or login with your details

Forgot password? Click here to reset