Options
Instability in progressive multiple sequence alignment algorithms
Author(s)
Date Issued
2015-10-09
Date Available
2015-12-16T10:39:03Z
Abstract
Background: Progressive alignment is the standard approach used to align large numbers of sequences. As with all heuristics, this involves a trade off between alignment accuracy and computation time. Results: We examine this trade off and find that, because of a loss of information in the early steps of the approach, the alignments generated by the most common multiple sequence alignment programs are inherently unstable, and simply reversing the order of the sequences in the input file will cause a different alignment to be generated. Although this effect is more obvious with larger numbers of sequences, it can also be seen with data sets in the order of one hundred sequences. We also outline the means to determine the number of sequences in a data set beyond which the probability of instability will become more pronounced. Conclusions: This has major ramifications for both the designers of large-scale multiple sequence alignment algorithms, and for the users of these alignments.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
BioMed Central
Journal
Algorithms for molecular biology
Volume
10
Issue
26
Start Page
1
End Page
10
Copyright (Published Version)
2015 the Authors
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
File(s)
Loading...
Name
art%3A10.1186%2Fs13015-015-0057-1.pdf
Size
967.38 KB
Format
Adobe PDF
Checksum (MD5)
18e48f78e8018ee840ef71c61fa97188
Owning collection
Mapped collections