Options
Variable selection and updating in model-based discriminant analysis for high dimensional data with food authenticity applications
Date Issued
2010-03
Date Available
2011-03-31T10:24:57Z
Abstract
Food authenticity studies are concerned with determining if food samples have been correctly labelled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is
fitted in a semi-supervised manner using both labeled and unlabeled data. The method is shown to give excellent classification
performance on several high-dimensional multiclass food authenticity datasets
with more variables than observations. The variables selected by the
proposed method provide information about which variables are meaningful for classification purposes. A headlong search strategy for variable selection is shown to be efficient in terms of
computation and achieves excellent classification performance. In
applications to several food authenticity datasets, our proposed
method outperformed default implementations of Random Forests, AdaBoost, transductive SVMs and Bayesian Multinomial Regression by substantial
margins.
fitted in a semi-supervised manner using both labeled and unlabeled data. The method is shown to give excellent classification
performance on several high-dimensional multiclass food authenticity datasets
with more variables than observations. The variables selected by the
proposed method provide information about which variables are meaningful for classification purposes. A headlong search strategy for variable selection is shown to be efficient in terms of
computation and achieves excellent classification performance. In
applications to several food authenticity datasets, our proposed
method outperformed default implementations of Random Forests, AdaBoost, transductive SVMs and Bayesian Multinomial Regression by substantial
margins.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
Institute of Mathematical Statistics
Journal
Annals of Applied Statistics
Volume
4
Issue
1
Start Page
396
End Page
421
Copyright (Published Version)
2010 The Institute of Mathematical Statistics
Subject – LCSH
Discriminant analysis
Food law and legislation
Food--Labeling
Web versions
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
File(s)
Loading...
Name
varselect-headlongonly-final.pdf
Size
456.48 KB
Format
Adobe PDF
Checksum (MD5)
79b2a058ddb67a892fc0b49b00fc70f0
Owning collection