Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Computer Science
  4. Computer Science Research Collection
  5. SCL-Epred: A generalised de novo eukaryotic protein subcellular localisation predictor
 
  • Details
Options

SCL-Epred: A generalised de novo eukaryotic protein subcellular localisation predictor

Author(s)
Mooney, Catherine  
Cessieux, Amélie  
Shields, Denis C.  
Pollastri, Gianluca  
Uri
http://hdl.handle.net/10197/12167
Date Issued
2013-08
Date Available
2021-05-18T12:16:30Z
Abstract
Knowledge of the subcellular location of a protein provides valuable information about its function, possible interaction with other proteins and drug targetability, among other things. The experimental determination of a protein's location in the cell is expensive, time consuming and open to human error. Fast and accurate predictors of subcellular location have an important role to play if the abundance of sequence data which is now available is to be fully exploited. In the post-genomic era, genomes in many diverse organisms are available. Many of these organisms are important in human and veterinary disease and fall outside of the well-studied plant, animal and fungi groups. We have developed a general eukaryotic subcellular localisation predictor (SCL-Epred) which predicts the location of eukaryotic proteins into three classes which are important, in particular, for determining the drug targetability of a protein - secreted proteins, membrane proteins and proteins that are neither secreted nor membrane. The algorithm powering SCL-Epred is a N-to-1 neural network and is trained on very large non-redundant sets of protein sequences. SCL-Epred performs well on training data achieving a Q of 86 % and a generalised correlation of 0.75 when tested in tenfold cross-validation on a set of 15,202 redundancy reduced protein sequences. The three class accuracy of SCL-Epred and LocTree2, and in particular a consensus predictor comprising both methods, surpasses that of other widely used predictors when benchmarked using a large redundancy reduced independent test set of 562 proteins. SCL-Epred is publicly available at http://distillf.ucd.ie/distill/.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
Springer
Journal
Amino Acids
Volume
45
Issue
2
Start Page
291
End Page
299
Copyright (Published Version)
2013 Springer
Subjects

Subcellular fractions...

Eukaryotic cells

Humans

Proteins

Membrane proteins

Proteome

Computational biology...

Amino acid sequence

Algorithms

DOI
10.1007/s00726-013-1491-3
Language
English
Status of Item
Peer reviewed
ISSN
0939-4451
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
File(s)
Loading...
Thumbnail Image
Name

Mooney_2012.pdf

Size

257.43 KB

Format

Adobe PDF

Checksum (MD5)

bf8d8e7b360549d15ab62aac373a97ba

Owning collection
Computer Science Research Collection
Mapped collections
CASL Research Collection•
Conway Institute Research Collection•
Medicine Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

For all queries please contact research.repository@ucd.ie.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement