A Perceptual Model of Musical Mix Clarity using Decomposition and Masking Thresholds

03/22/2021
by   Andrew Parker, et al.
0

Objective measurement of perceptually motivated music attributes has application in both target driven mixing and mastering methodologies and music information retrieval. This work proposes a perceptual model of mix clarity which decomposes a mixed input signal into transient, steady-state, and residual components. Masking thresholds are calculated for each component and their relative relationship is used to determine an overall masking score as the model's output. Three variants of the model were tested against subjective mix clarity scores gathered from a controlled listening test. The best performing variant achieved a Spearman's rank correlation of rho = 0.8382 (p<0.01). Furthermore, the model output was analysed using an independent dataset generated by progressively applying degradation effects to the test stimuli. Analysis of the model suggested a close relationship between the proposed model and the subjective mix clarity scores particularly when masking was measured using linearly spaced analysis bands. Moreover, the presence of noise-like residual signals was shown to have a negative effect on the perceived mix clarity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/13/2023

An Improved Metric of Informational Masking for Perceptual Audio Quality Measurement

Perceptual audio quality measurement systems algorithmically analyze the...
research
06/14/2021

Tracing Back Music Emotion Predictions to Sound Sources and Intuitive Perceptual Qualities

Music emotion recognition is an important task in MIR (Music Information...
research
08/26/2023

A Comprehensive Survey for Evaluation Methodologies of AI-Generated Music

In recent years, AI-generated music has made significant progress, with ...
research
12/08/2022

A Data-driven Cognitive Salience Model for Objective Perceptual Audio Quality Assessment

Objective audio quality measurement systems often use perceptual models ...
research
03/18/2022

Towards a Perceptual Model for Estimating the Quality of Visual Speech

Generating realistic lip motions to simulate speech production is key fo...
research
06/08/2020

Zero resource speech synthesis using transcripts derived from perceptual acoustic units

Zerospeech synthesis is the task of building vocabulary independent spee...
research
11/02/2017

Identification of potential Music Information Retrieval technologies for computer-aided jingju singing training

Music Information Retrieval (MIR) technologies have been proven useful i...

Please sign up or login with your details

Forgot password? Click here to reset