Options
A robust approach to model-based classification based on trimming and constraints
File(s)
File | Description | Size | Format | |
---|---|---|---|---|
insight_publication.pdf | 2.29 MB |
Date Issued
14 August 2019
Date Available
26T07:01:50Z August 2019
Abstract
In a standard classification framework a set of trustworthy learning data are employed to build a decision rule, with the final aim of classifying unlabelled units belonging to the test set. Therefore, unreliable labelled observations, namely outliers and data with incorrect labels, can strongly undermine the classifier performance, especially if the training size is small. The present work introduces a robust modification to the Model-Based Classification framework, employing impartial trimming and constraints on the ratio between the maximum and the minimum eigenvalue of the group scatter matrices. The proposed method effectively handles noise presence in both response and exploratory variables, providing reliable classification even when dealing with contaminated datasets. A robust information criterion is proposed for model selection. Experiments on real and simulated data, artificially adulterated, are provided to underline the benefits of the proposed method.
Sponsorship
Science Foundation Ireland
Other Sponsorship
Insight Research Centre
Type of Material
Journal Article
Publisher
Springer
Journal
Advances in Data Analysis and Classification
Volume
14
Start Page
327
End Page
354
Copyright (Published Version)
2019 Springer
Language
English
Status of Item
Peer reviewed
ISSN
1862-5347
This item is made available under a Creative Commons License
Owning collection
Scopus© citations
7
Acquisition Date
Mar 27, 2023
Mar 27, 2023
Views
595
Acquisition Date
Mar 27, 2023
Mar 27, 2023
Downloads
255
Last Month
1
1
Acquisition Date
Mar 27, 2023
Mar 27, 2023