Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
    Colleges & Schools
    Statistics
    All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Science
  3. School of Computer Science
  4. Computer Science and Informatics Technical Reports
  5. Template-based Recognition of Natively Disordered Regions in Proteins
 
  • Details
Options

Template-based Recognition of Natively Disordered Regions in Proteins

Author(s)
Vullo, Alessandro  
Roche, Cliona P.  
Pollastri, Gianluca  
Uri
http://hdl.handle.net/10197/12405
Date Issued
2012-04
Date Available
2021-08-11T11:14:30Z
Abstract
Disordered proteins are increasingly recognised as a fundamental component of the cellular machinery. Parallel to this, the prediction of protein disorder by computational means has emerged as an aid to the investigation of protein functions. Although predictors of disorder have met with considerable success, it is increasingly clear that further improvements are most likely to come from additional sources of information, to complement patterns extracted from the primary sequence of a protein. In this article, a system for the prediction of protein disorder that relies both on sequence information and on structural information from homologous proteins of known structure (templates) is described. Structural information is introduced directly (as a further input to the predictor) and indirectly through highly reliable template-based predictions of structural features of the protein. The predictive system, based on Support Vector Machines, is tested by rigorous 5-fold cross validation on a large, non-redundant set of proteins extracted from the Protein Data Bank. In these tests the introduction of structural information, which is carefully weighed based on sequence identity between homologues and query, results in large improvements in prediction accuracy. The method, when re-trained on a 2004 version of the PDB, clearly outperforms the algorithms that ranked top at the 2006 CASP competition.
Sponsorship
Health Research Board
Science Foundation Ireland
Type of Material
Technical Report
Publisher
University College Dublin. School of Computer Science and Informatics
Series
UCD CSI Technical Reports
ucd-csi-2012-01
Copyright (Published Version)
2012 the Authors
Subjects

Intrinsicaly disorder...

Functional proteomics...

Protein classificatio...

Machine learning

Web versions
https://web.archive.org/web/20080226040105/http:/csiweb.ucd.ie/Research/TechnicalReports.html
Language
English
Status of Item
Not peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
File(s)
Loading...
Thumbnail Image
Name

ucd-csi-2012-01.pdf

Size

314.14 KB

Format

Adobe PDF

Checksum (MD5)

b2b96abb03455e08d4b622e39725d4ee

Owning collection
Computer Science and Informatics Technical Reports
Mapped collections
CASL Research Collection

Item descriptive metadata is released under a CC-0 (public domain) license: https://creativecommons.org/public-domain/cc0/.
All other content is subject to copyright.

For all queries please contact research.repository@ucd.ie.

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement