Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Mathematics and Statistics
  4. Mathematics and Statistics Research Collection
  5. Probabilistic principal component analysis for metabolomic data
 
  • Details
Options

Probabilistic principal component analysis for metabolomic data

File(s)
FileDescriptionSizeFormat
Download NyamundandaBrennanGormley2010.pdf355.83 KB
Author(s)
Nyamundanda, Gift 
Brennan, Lorraine 
Gormley, Isobel Claire 
Uri
http://hdl.handle.net/10197/2835
Date Issued
23 November 2010
Date Available
10T11:43:31Z March 2011
Abstract
Background: Data from metabolomic studies are typically complex and high-dimensional. Principal component analysis (PCA) is currently the most widely used statistical technique for analyzing metabolomic data. However, PCA is limited by the fact that it is not based on a statistical model. Results: Here, probabilistic principal component analysis (PPCA) which addresses some of the limitations of PCA, is reviewed and extended. A novel extension of PPCA, called probabilistic principal component and covariates analysis (PPCCA), is introduced which provides a flexible approach to jointly model metabolomic data and additional covariate information. The use of a mixture of PPCA models for discovering the number of inherent groups in metabolomic data is demonstrated. The jackknife technique is employed to construct confidence intervals for estimated model parameters throughout. The optimal number of principal components is determined through the use of the Bayesian Information Criterion model selection tool, which is modified to address the high dimensionality of the data. Conclusions: The methods presented are illustrated through an application to metabolomic data sets. Jointly modeling metabolomic data and covariates was successfully achieved and has the potential to provide deeper insight to the underlying data structure. Examination of confidence intervals for the model parameters, such as loadings, allows for principled and clear interpretation of the underlying data structure. A software package called MetabolAnalyze, freely available through the R statistical software, has been developed to facilitate implementation of the presented methods in the metabolomics field.
Sponsorship
Irish Research Council for Science, Engineering and Technology
Health Research Board
Type of Material
Journal Article
Publisher
BioMed Central
Journal
BMC Bioinformatics
Volume
11
Issue
571
Copyright (Published Version)
2010 Nyamundanda et al
Keywords
  • Probabilistic PCA

  • Metabolomics

Subject – LCSH
Principal components analysis
Metabolites--Analysis
DOI
10.1186/1471-2105-11-571
Web versions
http://www.biomedcentral.com/1471-2105/11/571
Language
English
Status of Item
Peer reviewed
ISSN
1471-2105
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-sa/1.0/
Owning collection
Mathematics and Statistics Research Collection
Scopus© citations
95
Acquisition Date
Jan 24, 2023
View Details
Views
1644
Acquisition Date
Jan 26, 2023
View Details
Downloads
340
Last Week
2
Acquisition Date
Jan 26, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement