Options
Speech Quality Factors for Traditional and Neural-Based Low Bit Rate Vocoders
Date Issued
2020-05-28
Date Available
2021-05-26T11:54:57Z
Abstract
This study compares the performances of different algorithms for coding speech at low bit rates. In addition to widely deployed traditional vocoders, a selection of recently developed generative-model-based coders at different bit rates are contrasted. Performance analysis of the coded speech is evaluated for different quality aspects: accuracy of pitch periods estimation, the word error rates for automatic speech recognition, and the influence of speaker gender and coding delays. A number of performance metrics of speech samples taken from a publicly available database were compared with subjective scores. Results from subjective quality assessment do not correlate well with existing full reference speech quality metrics. The results provide valuable insights into aspects of the speech signal that will be used to develop a novel metric to accurately predict speech quality from generative-model-based coders.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
IEEE
Copyright (Published Version)
2020 IEEE
Language
English
Status of Item
Peer reviewed
Conference Details
International Conference on Quality of Multimedia Experience (QoMEX), Dublin, Ireland, 26-28 May 2020
ISBN
978-1-7281-5965-2
This item is made available under a Creative Commons License
File(s)
Loading...
Name
insight_publication.pdf
Size
881.8 KB
Format
Adobe PDF
Checksum (MD5)
c57af832b9f7f81606b4009acb810ae0
Owning collection
Mapped collections