Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. Analyzing the performance of autoencoder-based objective quality metrics on audio-visual content
 
  • Details
Options

Analyzing the performance of autoencoder-based objective quality metrics on audio-visual content

File(s)
FileDescriptionSizeFormat
Download insight_publication.pdf2.95 MB
Author(s)
Martinez, Helard 
Farias, Mylène C.Q. 
Hines, Andrew 
Uri
http://hdl.handle.net/10197/11353
Date Issued
30 January 2020
Date Available
29T14:57:27Z April 2020
Abstract
The development of audio-visual quality models faces a number of challenges, including the integration of audio and video sensory channels and the modeling of their interaction characteristics. Commonly, objective quality metrics estimate the quality of a single component (audio or video) of the content. Machine learning techniques, such as autoencoders, offer as a very promising alternative to develop objective assessment models. This paper studies the performance of a group of autoencoder-based objective quality metrics on a diverse set of audio-visual content. To perform this test, we use a large dataset of audio-visual content (The UnB-AV database), which contains degradations in both audio and video components. The database has accompanying subjective scores collected on three separate subjective experiments. We compare our autoencoder-based methods, which take into account both audio and video components (multi-modal), against several objective (single-modal) audio and video quality metrics. The main goal of this work is to verify the gain or loss in performance of these single-modal metrics, when tested on audio-visual sequences.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
Society for Imaging Science and Technology
Copyright (Published Version)
2020 Society for Imaging Science and Technology
Keywords
  • Audio quality

  • Video quality

  • Autoencoder

  • No-reference quality ...

  • Audio degradations

  • Video degradations

DOI
10.2352/ISSN.2470-1173.2020.9.IQSP-167
Language
English
Status of Item
Peer reviewed
Description
The 2020 IS&T International Symposium on Electronic Imaging (EI2020), Burlingame, California, 26-30 January 2020
ISSN
2470-1173
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Insight Research Collection
Scopus© citations
0
Acquisition Date
Feb 6, 2023
View Details
Views
591
Last Week
1
Last Month
1
Acquisition Date
Feb 6, 2023
View Details
Downloads
187
Last Week
5
Last Month
10
Acquisition Date
Feb 6, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement