Multitask learning for frame-level instrument recognition

11/03/2018
by   Yun-Ning Hung, et al.
0

For many music analysis problems, we need to know the presence of instruments for each time frame in a multi-instrument musical piece. However, such a frame-level instrument recognition task remains difficult, mainly due to the lack of labeled datasets. To address this issue, we present in this paper a large-scale dataset that contains synthetic polyphonic music with frame-level pitch and instrument labels. Moreover, we propose a simple yet novel network architecture to jointly predict the pitch and instrument for each frame. With this multitask learning method, the pitch information can be leveraged to predict the instruments, and also the other way around. And, by using the so-called pianoroll representation of music as the main target output of the model, our model also predicts the instruments that play each individual note event. We validate the effectiveness of the proposed method for framelevel instrument recognition by comparing it with its singletask ablated versions and three state-of-the-art methods. We also demonstrate the result of the proposed method for multipitch streaming with real-world music. For reproducibility, we will share the code to crawl the data and to implement the proposed model at: https://github.com/biboamy/ instrument-streaming.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/25/2018

Frame-level Instrument Recognition by Timbre and Pitch

Instrument recognition is a fundamental task in music information retrie...
research
05/05/2018

Weakly-supervised Visual Instrument-playing Action Detection in Videos

Instrument playing is among the most common scenes in music-related vide...
research
11/08/2018

Learning Disentangled Representations for Timber and Pitch in Music Audio

Timbre and pitch are the two main perceptual properties of musical sound...
research
04/04/2023

A2D: Anywhere Anytime Drumming

The drum kit, which has only been around for around 100 years, is a popu...
research
07/01/2017

An Augmented Lagrangian Method for Piano Transcription using Equal Loudness Thresholding and LSTM-based Decoding

A central goal in automatic music transcription is to detect individual ...
research
07/09/2019

An Attention Mechanism for Musical Instrument Recognition

While the automatic recognition of musical instruments has seen signific...
research
01/24/2020

Learning Multi-instrument Classification with Partial Labels

Multi-instrument recognition is the task of predicting the presence or a...

Please sign up or login with your details

Forgot password? Click here to reset