Towards a Perceived Audiovisual Quality Model for Immersive Content
May 19, 2020 Β· Declared Dead Β· π International Workshop on Quality of Multimedia Experience
"No code URL or promise found in abstract"
Evidence collected by the PWNC Scanner
Authors
Randy Frans Fela, Nick Zacharov, SΓΈren Forchhammer
arXiv ID
2005.09309
Category
cs.MM: Multimedia
Citations
13
Venue
International Workshop on Quality of Multimedia Experience
Last Checked
2 months ago
Abstract
This paper studies the quality of multimedia content focusing on 360 video and ambisonic spatial audio reproduced using a head-mounted display and a multichannel loudspeaker setup. Encoding parameters following basic video quality test conditions for 360 videos were selected and a low-bitrate codec was used for the audio encoder. Three subjective experiments were performed for the audio, video, and audiovisual respectively. Peak signal-to-noise ratio (PSNR) and its variants for 360 videos were computed to obtain objective quality metrics and subsequently correlated with the subjective video scores. This study shows that a Cross-Format SPSNR-NN has a slightly higher linear and monotonic correlation over all video sequences. Based on the audiovisual model, a power model shows a highest correlation between test data and predicted scores. We concluded that to enable the development of superior predictive model, a high quality, critical, synchronized audiovisual database is required. Furthermore, comprehensive assessor training may be beneficial prior to the testing to improve the assessors' discrimination ability particularly with respect to multichannel audio reproduction. In order to further improve the performance of audiovisual quality models for immersive content, in addition to developing broader and critical audiovisual databases, the subjective testing methodology needs to be evolved to provide greater resolution and robustness.
Community Contributions
Found the code? Know the venue? Think something is wrong? Let us know!
π Similar Papers
In the same crypt β Multimedia
R.I.P.
π»
Ghosted
π
π
Old Age
Quality Assessment of In-the-Wild Videos
R.I.P.
π»
Ghosted
Viewport-Adaptive Navigable 360-Degree Video Delivery
R.I.P.
π»
Ghosted
A Comprehensive Survey on Cross-modal Retrieval
R.I.P.
π»
Ghosted
An Overview of Cross-media Retrieval: Concepts, Methodologies, Benchmarks and Challenges
R.I.P.
π»
Ghosted
A Convolutional Neural Network Approach for Post-Processing in HEVC Intra Coding
Died the same way β π» Ghosted
R.I.P.
π»
Ghosted
Language Models are Few-Shot Learners
R.I.P.
π»
Ghosted
PyTorch: An Imperative Style, High-Performance Deep Learning Library
R.I.P.
π»
Ghosted
XGBoost: A Scalable Tree Boosting System
R.I.P.
π»
Ghosted