Options
Exploring Composite Dataset Biases for Heart Sound Classification
Date Issued
2020-12-08
Date Available
2024-02-13T11:24:06Z
Abstract
In the last few years, the automatic classification of heart sounds has been widely studied as a screening method for heart disease. Some of these studies have achieved high accuracies in heart abnormality prediction. However, for such models to assist clinicians in the detection of heart abnormalities, it is of critical importance that they are generalisable, working on unseen real-world data. Despite the importance of generalisability, the presence of bias in the leading heart sound datasets used in these studies has remained unexplored. In this paper, we explore the presence of potential bias in heart sound datasets. Using a small set of spectral features for heart sound representation, we demonstrate experimentally that it is possible to detect sub-datasets of PhysioNet, the leading dataset of the field, with 98% accuracy. We also show that sensors which have been used to capture recordings of each dataset are likely the main cause of the bias in these datasets. Lack of awareness of this bias works against generalised models for heart sound diagnostics. Our findings call for further research on the bias issue in heart sound datasets and its impact on the generalisability of heart abnormality prediction models.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Conference Publication
Publisher
CEUR Workshop Proceedings
Series
CEUR Workshop Proceedings
2771
Copyright (Published Version)
2020 the Authors
Language
English
Status of Item
Peer reviewed
Journal
Longo, L., Rizzo, L., Hunter, E. and Pakrashi, A. (eds.). AICS2020: Artificial Intelligence and Cognitive Science, Proceedings of The 28th Irish Conference on Artificial Intelligence and Cognitive Science Dublin, Republic of Ireland, December 7-8, 2020
Conference Details
The 28th Irish Conference on Artificial Intelligence and Cognitive Science (AICS 2020), Technical University Dublin, Ireland (held online due to coronavirus outbreak), 7-8 December 2020
ISSN
1613-0073
This item is made available under a Creative Commons License
File(s)
Loading...
Name
Exploring Composite Dataset Biases for Heart Sound Classification.pdf
Size
494.31 KB
Format
Adobe PDF
Checksum (MD5)
3173e65432d36ddeba0747dc768239a5
Owning collection