Options
Profile-based short linear protein motif discovery
Author(s)
Date Issued
2012-05-18
Date Available
2012-09-05T15:37:09Z
Abstract
Background
Short linear protein motifs are attracting increasing attention as functionally independent sites, typically 3-10 amino acids in length that are enriched in disordered regions of proteins. Multiple methods have recently been proposed to discover over-represented motifs within a set of proteins based on simple regular expressions. Here, we extend these approaches to profile-based methods, which provide a richer motif representation.
Results
The profile motif discovery method MEME performed relatively poorly for motifs in disordered regions of proteins. However, when we applied evolutionary weighting to account for redundancy amongst homologous proteins, and masked out poorly conserved regions of disordered proteins, the performance of MEME is equivalent to that of regular expression methods. However, the two approaches returned different subsets within both a benchmark dataset, and a more realistic discovery dataset.
Conclusions
Profile-based motif discovery methods complement regular expression based methods. Whilst profile-based methods are computationally more intensive, they are likely to discover motifs currently overlooked by regular expression methods.
Short linear protein motifs are attracting increasing attention as functionally independent sites, typically 3-10 amino acids in length that are enriched in disordered regions of proteins. Multiple methods have recently been proposed to discover over-represented motifs within a set of proteins based on simple regular expressions. Here, we extend these approaches to profile-based methods, which provide a richer motif representation.
Results
The profile motif discovery method MEME performed relatively poorly for motifs in disordered regions of proteins. However, when we applied evolutionary weighting to account for redundancy amongst homologous proteins, and masked out poorly conserved regions of disordered proteins, the performance of MEME is equivalent to that of regular expression methods. However, the two approaches returned different subsets within both a benchmark dataset, and a more realistic discovery dataset.
Conclusions
Profile-based motif discovery methods complement regular expression based methods. Whilst profile-based methods are computationally more intensive, they are likely to discover motifs currently overlooked by regular expression methods.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
BioMed Central
Journal
BMC Bioinformatics
Volume
13
Issue
104
Copyright (Published Version)
2012 Haslam and Shields
Subjects
Subject – LCSH
Protein-protein interactions
Proteins
Language
English
Status of Item
Peer reviewed
ISSN
1471-2105
This item is made available under a Creative Commons License
File(s)
Loading...
Name
Haslam, Shields - 2012.pdf
Size
193.49 KB
Format
Adobe PDF
Checksum (MD5)
96968946098431874410c455131b08e8
Owning collection
Mapped collections