Perceived Audiovisual Quality Modelling based on Decison Trees, Genetic Programming and Neural Networks

by   Edip Demirbilek, et al.

Our objective is to build machine learning based models that predict audiovisual quality directly from a set of correlated parameters that are extracted from a target quality dataset. We have used the bitstream version of the INRS audiovisual quality dataset that reflects contemporary real-time configurations for video frame rate, video quantization, noise reduction parameters and network packet loss rate. We have utilized this dataset to build bitstream perceived quality estimation models based on the Random Forests, Bagging, Deep Learning and Genetic Programming methods. We have taken an empirical approach and have generated models varying from very simple to the most complex depending on the number of features used from the quality dataset. Random Forests and Bagging models have overall generated the most accurate results in terms of RMSE and Pearson correlation coefficient values. Deep Learning and Genetic Programming based bitstream models have also achieved good results but that high performance was observed only with a limited range of features. We have also obtained the epsilon-insensitive RMSE values for each model and have computed the significance of the difference between the correlation coefficients. Overall we conclude that computing the bitstream information is worth the effort it takes to generate and helps to build more accurate models for real-time communications. However, it is useful only for the deployment of the right algorithms with the carefully selected subset of the features. The dataset and tools that have been developed during this research are publicly available for research and development purposes.


page 1

page 2

page 3

page 4


DeepRS: Deep-learning Based Network-Adaptive FEC for Real-Time Video Communications

This work proposes an innovative approach to handle packet loss in real-...

Banding vs. Quality: Perceptual Impact and Objective Assessment

Staircase-like contours introduced to a video by quantization in flat ar...

Full Reference Video Quality Assessment for Machine Learning-Based Video Codecs

Machine learning-based video codecs have made significant progress in th...

Guidelines and Benchmarks for Deployment of Deep Learning Models on Smartphones as Real-Time Apps

Deep learning solutions are being increasingly used in mobile applicatio...

Data Aggregation for Reducing Training Data in Symbolic Regression

The growing volume of data makes the use of computationally intense mach...

Reinforcement learning for bandwidth estimation and congestion control in real-time communications

Bandwidth estimation and congestion control for real-time communications...

HOLMES: Real-time APT Detection through Correlation of Suspicious Information Flows

This version withdrawn by arXiv administrators because the author did no...

Please sign up or login with your details

Forgot password? Click here to reset