Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. Institutes and Centres
  3. Insight Centre for Data Analytics
  4. Insight Research Collection
  5. GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data
 
  • Details
Options

GOexpress: an R/Bioconductor package for the identification and visualisation of robust gene ontology signatures through supervised learning of gene expression data

Author(s)
Rue-Albrecht, Kévin  
McGettigan, Paul A.  
Hernández, Belinda  
Nalpas, Nicholas C.  
Magee, David A.  
Parnell, Andrew C.  
Gordon, Stephen V.  
MacHugh, David E.  
Uri
http://hdl.handle.net/10197/7882
Date Issued
2016-03-11
Date Available
2016-09-06T12:43:17Z
Abstract
Background: Identification of gene expression profiles that differentiate experimental groups is critical for discovery and analysis of key molecular pathways and also for selection of robust diagnostic or prognostic biomarkers. While integration of differential expression statistics has been used to refine gene set enrichment analyses, such approaches are typically limited to single gene lists resulting from simple two-group comparisons or time-series analyses. In contrast, functional class scoring and machine learning approaches provide powerful alternative methods to leverage molecular measurements for pathway analyses, and to compare continuous and multi-level categorical factors. Results: We introduce GOexpress, a software package for scoring and summarising the capacity of gene ontology features to simultaneously classify samples from multiple experimental groups. GOexpress integrates normalised gene expression data (e.g., from microarray and RNA-seq experiments) and phenotypic information of individual samples with gene ontology annotations to derive a ranking of genes and gene ontology terms using a supervised learning approach. The default random forest algorithm allows interactions between all experimental factors, and competitive scoring of expressed genes to evaluate their relative importance in classifying predefined groups of samples. Conclusions: GOexpress enables rapid identification and visualisation of ontology-related gene panels that robustly classify groups of samples and supports both categorical (e.g., infection status, treatment) and continuous (e.g., time-series, drug concentrations) experimental factors. The use of standard Bioconductor extension packages and publicly available gene ontology annotations facilitates straightforward integration of GOexpress within existing computational biology pipelines.
Sponsorship
Department of Agriculture, Food and the Marine
European Commission - Seventh Framework Programme (FP7)
Science Foundation Ireland
University College Dublin
Type of Material
Journal Article
Publisher
BioMed Central
Journal
BMC Bioinformatics
Volume
17
Issue
126
Start Page
1
End Page
12
Copyright (Published Version)
2016 the Authors
Subjects

Machine learning

Statistics

Gene expression

Gene ontology

Supervised learning

Classification

Microarray

RNA-sequencing

DOI
10.1186/s12859-016-0971-3
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
File(s)
Loading...
Thumbnail Image
Name

insight_publication.pdf

Size

1.51 MB

Format

Adobe PDF

Checksum (MD5)

aef427988df37135c3c0d39f6bec74d1

Owning collection
Insight Research Collection
Mapped collections
Agriculture and Food Science Research Collection•
Conway Institute Research Collection•
Mathematics and Statistics Research Collection•
Veterinary Medicine Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

For all queries please contact research.repository@ucd.ie.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement