Predicting Preferred Dialogue-to-Background Loudness Difference in Dialogue-Separated Audio

05/30/2023
by   Luca Resti, et al.
0

Dialogue Enhancement (DE) enables the rebalancing of dialogue and background sounds to fit personal preferences and needs in the context of broadcast audio. When individual audio stems are unavailable from production, Dialogue Separation (DS) can be applied to the final audio mixture to obtain estimates of these stems. This work focuses on Preferred Loudness Differences (PLDs) between dialogue and background sounds. While previous studies determined the PLD through a listening test employing original stems from production, stems estimated by DS are used in the present study. In addition, a larger variety of signal classes is considered. PLDs vary substantially across individuals (average interquartile range: 5.7 LU). Despite this variability, PLDs are found to be highly dependent on the signal type under consideration, and it is shown that median PLDs can be predicted using objective intelligibility metrics. Two existing baseline prediction methods - intended for use with original stems - displayed a Mean Absolute Error (MAE) of 7.5 LU and 5 LU, respectively. A modified baseline (MAE: 3.2 LU) and an alternative approach (MAE: 2.5 LU) are proposed. Results support the viability of processing final broadcast mixtures with DS and offering an alternative remixing that accounts for median PLDs.

READ FULL TEXT
research
06/25/2020

Dialogue Enhancement in Object-based Audio – Evaluating the Benefit on People above 65

Due to age-related hearing loss, elderly people often struggle with foll...
research
03/23/2023

Better Together: Dialogue Separation and Voice Activity Detection for Audio Personalization in TV

In TV services, dialogue level personalization is key to meeting user pr...
research
11/02/2021

Reduction of Subjective Listening Effort for TV Broadcast Signals with Recurrent Neural Networks

Listening to the audio of TV broadcast signals can be challenging for he...
research
06/05/2022

Sampling Frequency Independent Dialogue Separation

In some DNNs for audio source separation, the relevant model parameters ...
research
07/21/2021

Controlling the Remixing of Separated Dialogue with a Non-Intrusive Quality Estimate

Remixing separated audio sources trades off interferer attenuation again...
research
09/03/2017

Topic Independent Identification of Agreement and Disagreement in Social Media Dialogue

Research on the structure of dialogue has been hampered for years becaus...

Please sign up or login with your details

Forgot password? Click here to reset