Hierarchical Linear Dynamical System for Representing Notes from Recorded Audio

02/27/2022
by   Leila Kalantari, et al.
0

We seek to develop simultaneous segmentation and classification of notes from audio recordings in presence of outliers. The selected architecture for modeling time series is hierarchical linear dynamical system (HLDS). We propose a novel method for its parameter setting. HLDS can potentially be employed in two ways: 1) simultaneous segmentation and clustering for exploring data, i.e. finding unknown notes, 2) simultaneous segmentation and classification of audio recording for finding the notes of interest in the presence of outliers. We adapted HLDS for the second purpose since it is an easier task and still a challenging problem, e.g. in the field of bioacoustics. Each test clip has the same notes (but different instances) as of the training clip and also contain outlier notes. At test, it is automatically decided to which class of interest a note belongs to if any. Two applications of this work are to the fields of bioacoustics for detection of animal sounds in audio field recordings and also to musicology. Experiments have been conducted for segmentation and classification of both avian and musical notes from recorded audio.

READ FULL TEXT
research
08/17/2016

Lecture Notes on Spectral Graph Methods

These are lecture notes that are based on the lectures from a class I ta...
research
12/17/2021

MIDI-DDSP: Detailed Control of Musical Performance via Hierarchical Modeling

Musical expression requires control of both what notes are played, and h...
research
03/13/2018

Learning to Recognize Musical Genre from Audio

We here summarize our experience running a challenge with open data for ...
research
08/04/2023

Finding Tori: Self-supervised Learning for Analyzing Korean Folk Song

In this paper, we introduce a computational analysis of the field record...
research
08/29/2021

Uncertainty quantification for multiclass data description

In this manuscript, we propose a multiclass data description model based...
research
10/05/2020

High-resolution Piano Transcription with Pedals by Regressing Onsets and Offsets Times

Automatic music transcription (AMT) is the task of transcribing audio re...
research
04/09/2018

Polyphonic Pitch Tracking with Deep Layered Learning

This paper presents a polyphonic pitch tracking system able to extract b...

Please sign up or login with your details

Forgot password? Click here to reset