Options
Playout again Sam: Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models?
Author(s)
Date Issued
2020-06-12
Date Available
2024-04-22T07:44:14Z
Abstract
Objective speech quality assessment techniques, which use the perceptual models to emulate the human listening perception, have seen several revisions in the recent years. This study investigates the evolution of POLQA and ViSQOL models and scrutinise their latest versions. Prior work had identified weaknesses in both prediction models when presented with speech containing imperceptible playout adjustments. This study follows up the experiments to evaluate the progress and report the progress and the current issues, benchmarked against subjective listening quality scores. The assessment is conducted for all published versions of the POLQA and ViSQOL models and the evolution and improvement offered is analysed. We can conclude that the models have been improved in terms of imperceptible jitter buffer adjustments highlighted in prior work. This study also explores the performance of objective quality models and intelligibility (STOI and POLQA Intelligibility) models for a data set produced with realistic but extreme WebRTC scenarios using a standard and novel WebRTC jitter buffer strategy. An expert listening test was conducted to subjectively evaluate the WebRTC data set. It is observed that the standard WebRTC jitter buffer strategy produces more natural speech while the novel approach offers better intelligibility. The subjective and objective quality results suggest that the speech quality for standard jitter buffer were lower but more consistent than for the novel jitter buffer. The objective intelligibility results were conflicting. A followup study will conduct independent subjective evaluations of quality and intelligibility to further explore the relationship between the objective intelligibility and quality results
Sponsorship
European Commission - European Regional Development Fund
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
IEEE
Copyright (Published Version)
2020 IEEE
Web versions
Language
English
Status of Item
Peer reviewed
Journal
2020 31st Irish Signals and Systems Conference (ISSC)
Conference Details
The 2020 31st Irish Signals and Systems Conference (ISSC), Letterkenny, Ireland (held online due to Coronavirus outbreak), 11-12 June 2020
ISBN
978-1-7281-9418-9
This item is made available under a Creative Commons License
File(s)
Loading...
Name
Playout again Sam- Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models_ .pdf
Size
247.49 KB
Format
Adobe PDF
Checksum (MD5)
73d855b4a95ea7368da73778354cf972
Owning collection
Mapped collections