Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Health and Agricultural Sciences
  3. School of Medicine
  4. Medicine Research Collection
  5. Making automated multiple alignments of very large numbers of protein sequences
 
  • Details
Options

Making automated multiple alignments of very large numbers of protein sequences

File(s)
FileDescriptionSizeFormat
Download Bioinformatics-2013-Sievers-989-95.pdf710.68 KB
Author(s)
Sievers, Fabian 
Dineen, David 
Wilm, Andreas 
Higgins, Desmond G 
Uri
http://hdl.handle.net/10197/7308
Date Issued
21 February 2013
Date Available
16T10:07:01Z December 2015
Abstract
Motivation: Recent developments in sequence alignment software have made possible multiple sequence alignments (MSAs) of >100 000 sequences in reasonable times. At present, there are no systematic analyses concerning the scalability of the alignment quality as the number of aligned sequences is increased. Results: We benchmarked a wide range of widely used MSA packages using a selection of protein families with some known structures and found that the accuracy of such alignments decreases markedly as the number of sequences grows. This is more or less true of all packages and protein families. The phenomenon is mostly due to the accumulation of alignment errors, rather than problems in guide-tree construction. This is partly alleviated by using iterative refinement or selectively adding sequences. The average accuracy of progressive methods by comparison with structure-based benchmarks can be improved by incorporating information derived from high-quality structural alignments of sequences with solved structures. This suggests that the availability of high quality curated alignments will have to complement algorithmic and/or software developments in the long-term.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
Oxford University Press
Journal
Bioinformatics
Volume
29
Issue
8
Start Page
989
End Page
995
Copyright (Published Version)
2013 the Author
Keywords
  • DNA sequencing

  • Sequence analysis

DOI
10.1093/bioinformatics/btt093
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Medicine Research Collection
Scopus© citations
42
Acquisition Date
Jan 29, 2023
View Details
Views
1341
Acquisition Date
Jan 29, 2023
View Details
Downloads
320
Last Month
104
Acquisition Date
Jan 29, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement