Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Computer Science
  4. Computer Science Research Collection
  5. SCLpred : protein subcellular localization prediction by N-to-1 neural networks
 
  • Details
Options

SCLpred : protein subcellular localization prediction by N-to-1 neural networks

Author(s)
Mooney, Catherine  
Wang, Yong-Hong  
Pollastri, Gianluca  
Uri
http://hdl.handle.net/10197/3443
Date Issued
2011-08-27
Date Available
2012-01-20T14:46:58Z
Abstract
Knowledge of the subcellular location of a protein provides valuable information about its function and possible interaction with other proteins. In the post-genomic era, fast and accurate predictors of subcellular location are required if this abundance of sequence data is to be fully exploited. We have developed a subcellular localization predictor (SCLpred), which predicts the location of a protein into four classes for animals and fungi and five classes for plants (secreted, cytoplasm, nucleus, mitochondrion and chloroplast) using machine learning models trained on large non-redundant sets of protein sequences. The algorithm powering SCLpred is a novel Neural Network (N-to-1 Neural Network, or N1-NN) we have developed, which is capable of mapping whole sequences into single properties (a functional class, in this work) without resorting to predefined transformations, but rather by adaptively compressing the sequence into a hidden feature vector. We benchmark SCLpred against other publicly available predictors using two benchmarks including a new subset of Swiss-Prot Release 2010_06. We show that SCLpred surpasses the state of the art. The N1-NN algorithm is fully general and may be applied to a host of problems of similar shape, that is, in which a whole sequence needs to be mapped into a fixed-size array of properties, and the adaptive compression it operates may shed light on the space of protein sequences.
The predictive systems described in this article are publicly available as a web server at http://distill.ucd.ie/distill/.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
Oxford University Press
Journal
Bioinformatics
Volume
27
Issue
20
Start Page
2812
End Page
2819
Copyright (Published Version)
Oxford University Press 2011
Subjects

Subcellular localizat...

N-to-1 neural network...

Subject – LCSH
Proteins--Analysis
Neural networks (Computer science)
Amino acid sequence
DOI
10.1093/bioinformatics/btr494
Language
English
Status of Item
Peer reviewed
ISSN
1460-2059 (Online)
1367-4803 (Print)
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-sa/1.0/
File(s)
Loading...
Thumbnail Image
Name

Mooney_2011bio.pdf

Size

255.68 KB

Format

Adobe PDF

Checksum (MD5)

e05223b5d1ebda1af3aa89acda4f50e8

Owning collection
Computer Science Research Collection
Mapped collections
CASL Research Collection•
Conway Institute Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement