Probabilistic principal component analysis for metabolomic data

Files in This Item:
File Description SizeFormat 
NyamundandaBrennanGormley2010.pdf355.83 kBAdobe PDFDownload
Title: Probabilistic principal component analysis for metabolomic data
Authors: Nyamundanda, Gift
Brennan, Lorraine
Gormley, Isobel Claire
Permanent link:
Date: 23-Nov-2010
Abstract: Background: Data from metabolomic studies are typically complex and high-dimensional. Principal component analysis (PCA) is currently the most widely used statistical technique for analyzing metabolomic data. However, PCA is limited by the fact that it is not based on a statistical model. Results: Here, probabilistic principal component analysis (PPCA) which addresses some of the limitations of PCA, is reviewed and extended. A novel extension of PPCA, called probabilistic principal component and covariates analysis (PPCCA), is introduced which provides a flexible approach to jointly model metabolomic data and additional covariate information. The use of a mixture of PPCA models for discovering the number of inherent groups in metabolomic data is demonstrated. The jackknife technique is employed to construct confidence intervals for estimated model parameters throughout. The optimal number of principal components is determined through the use of the Bayesian Information Criterion model selection tool, which is modified to address the high dimensionality of the data. Conclusions: The methods presented are illustrated through an application to metabolomic data sets. Jointly modeling metabolomic data and covariates was successfully achieved and has the potential to provide deeper insight to the underlying data structure. Examination of confidence intervals for the model parameters, such as loadings, allows for principled and clear interpretation of the underlying data structure. A software package called MetabolAnalyze, freely available through the R statistical software, has been developed to facilitate implementation of the presented methods in the metabolomics field.
Funding Details: Irish Research Council for Science, Engineering and Technology
Health Research Board
Type of material: Journal Article
Publisher: BioMed Central
Journal: BMC Bioinformatics
Volume: 11
Issue: 571
Copyright (published version): 2010 Nyamundanda et al
Keywords: Probabilistic PCAMetabolomics
Subject LCSH: Principal components analysis
DOI: 10.1186/1471-2105-11-571
Other versions:
Language: en
Status of Item: Peer reviewed
Appears in Collections:Mathematics and Statistics Research Collection

Show full item record

Citations 5

Last Week
Last month
checked on Oct 11, 2018

Page view(s) 50

checked on May 25, 2018

Download(s) 50

checked on May 25, 2018

Google ScholarTM



This item is licensed under a Creative Commons License Creative Commons