Playout again Sam: Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models?

Cinar, Yusuf; Pocta, Peter; Hines, Andrew

doi:10.1109/ISSC49989.2020.9180163

Playout again Sam: Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models?

Author(s)

Cinar, Yusuf

Pocta, Peter

Hines, Andrew

Uri

http://hdl.handle.net/10197/25683

Date Issued

2020-06-12

Date Available

2024-04-22T07:44:14Z

Abstract

Objective speech quality assessment techniques, which use the perceptual models to emulate the human listening perception, have seen several revisions in the recent years. This study investigates the evolution of POLQA and ViSQOL models and scrutinise their latest versions. Prior work had identified weaknesses in both prediction models when presented with speech containing imperceptible playout adjustments. This study follows up the experiments to evaluate the progress and report the progress and the current issues, benchmarked against subjective listening quality scores. The assessment is conducted for all published versions of the POLQA and ViSQOL models and the evolution and improvement offered is analysed. We can conclude that the models have been improved in terms of imperceptible jitter buffer adjustments highlighted in prior work. This study also explores the performance of objective quality models and intelligibility (STOI and POLQA Intelligibility) models for a data set produced with realistic but extreme WebRTC scenarios using a standard and novel WebRTC jitter buffer strategy. An expert listening test was conducted to subjectively evaluate the WebRTC data set. It is observed that the standard WebRTC jitter buffer strategy produces more natural speech while the novel approach offers better intelligibility. The subjective and objective quality results suggest that the speech quality for standard jitter buffer were lower but more consistent than for the novel jitter buffer. The objective intelligibility results were conflicting. A followup study will conduct independent subjective evaluations of quality and intelligibility to further explore the relationship between the objective intelligibility and quality results

Sponsorship

European Commission - European Regional Development Fund

Science Foundation Ireland

Other Sponsorship

Insight Research Centre

Type of Material

Conference Publication

Publisher

IEEE

Copyright (Published Version)

2020 IEEE

Subjects

Jitter

WebRTC

Predictive models

Computational modelin...

Benchmark testing

Delay estimation

Standards

DOI

10.1109/ISSC49989.2020.9180163

Web versions

https://www.issc.ie/conference

Language

English

Status of Item

Peer reviewed

Journal

2020 31st Irish Signals and Systems Conference (ISSC)

Conference Details

The 2020 31st Irish Signals and Systems Conference (ISSC), Letterkenny, Ireland (held online due to Coronavirus outbreak), 11-12 June 2020

ISBN

978-1-7281-9418-9

This item is made available under a Creative Commons License

https://creativecommons.org/licenses/by-nc-nd/3.0/ie/

Name

Playout again Sam- Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models_ .pdf

Size

247.49 KB

Format

Adobe PDF

Checksum (MD5)

73d855b4a95ea7368da73778354cf972

Owning collection

Insight Research Collection

Mapped collections

Computer Science Research Collection

Options

Playout again Sam: Jitter Buffer Playout Adjustments Still an Issue for Speech Quality Prediction Models?