Streaming VR for Immersion: Quality aspects of Compressed Spatial Audio

Files in This Item:
File Description SizeFormat 
VSMM2017_camera-ready.pdf382.22 kBAdobe PDFDownload
Title: Streaming VR for Immersion: Quality aspects of Compressed Spatial Audio
Authors: Narbutt, Miroslaw
O’Leary, Sean
Allen, Andrew
Skoglund, Jan
Hines, Andrew
Permanent link: http://hdl.handle.net/10197/9817
Date: 5-Nov-2017
Online since: 2019-04-04T10:11:14Z
Abstract: Delivering a 360-degree soundscape that matches full sphere visuals is an essential aspect of immersive VR. Ambisonics is a full sphere surround sound technique that takes into account the azimuth and elevation of sound sources, portraying source location above and below as well as around the horizontal plane of the listener. In contrast to channel-based methods, ambisonics representation offers the advantage of being independent of a specific loudspeaker set-up. Streaming ambisonics over networks requires efficient encoding techniques that compress the raw audio content without compromising quality of experience (QoE). This work investigates the effect of audio channel compression via the OPUS 1.2 codec on the quality of spatial audio as perceived by listeners. In particular we evaluate the listening quality and localization accuracy of first-order ambisonic audio (FOA) and third-order ambisonic audio (HOA) compressed at various bitrates (i.e. 32, 64, 128 and 128, 256, 512kbps respectively). To assess the impact of OPUS compression on spatial audio a number of subjective listening tests were carried out. The sample set for the tests comprises both recorded and synthetic audio clips with a wide range of time-frequency characteristics. In order to evaluate localization accuracy of compressed audio a number of fixed and dynamic (moving vertically and horizontally) source positions were selected for the test samples. The results show that for compressed spatial audio, perceived quality and localization accuracy are influenced more by compression scheme, bitrate and ambisonic order than by sample content. The insights provided by this work into factors and parameters influencing QoE will guide future development of a objective spatial audio quality metric.
Funding Details: European Commission
Science Foundation Ireland
Type of material: Conference Publication
Publisher: International Society on Virtual Systems and MultiMedia
Keywords: Virtual realitySpatial audioAmbisonicsAudio codingVideo compressionOpus codecMUSHRA
Other versions: http://vsmm.org/belfast/
Language: en
Status of Item: Peer reviewed
Conference Details: 23rd International Conference on Virtual Systems and Multimedia: Through the Looking Glass - Back to the Future of Virtual Reality, Belfast, Northern Ireland, 3-5 November 2017
Appears in Collections:Computer Science Research Collection

Show full item record

Google ScholarTM

Check


This item is available under the Attribution-NonCommercial-NoDerivs 3.0 Ireland. No item may be reproduced for commercial purposes. For other possible restrictions on use please refer to the publisher's URL where this is made available, or to notes contained in the item itself. Other terms may apply.