Repository logo
  • Log In
    New user? Click here to register.Have you forgotten your password?
University College Dublin
  • Colleges & Schools
  • Statistics
  • All of DSpace
  • Log In
    New user? Click here to register.Have you forgotten your password?
  1. Home
  2. College of Health and Agricultural Sciences
  3. School of Medicine
  4. Medicine Research Collection
  5. Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega
 
  • Details
Options

Fast, scalable generation of high-quality protein multiple sequence alignments using Clustal Omega

File(s)
FileDescriptionSizeFormat
Download ClustalOmegapaper-mod.pdf428.98 KB
Author(s)
Sievers, Fabian 
Wilm, Andreas 
Dineen, David 
Higgins, Desmond G 
et al. 
Uri
http://hdl.handle.net/10197/7320
Date Issued
11 October 2011
Date Available
18T10:32:12Z December 2015
Abstract
Multiple sequence alignments are fundamental to many sequence analysis methods. Most alignments are computed using the progressive alignment heuristic. These methods are starting to become a bottleneck in some analysis pipelines when faced with data sets of the size of many thousands of sequences. Some methods allow computation of larger data sets while sacrificing quality, and others produce high-quality alignments, but scale badly with the number of sequences. In this paper, we describe a new program called Clustal Omega, which can align virtually any number of protein sequences quickly and that delivers accurate alignments. The accuracy of the package on smaller test cases is similar to that of the high-quality aligners. On larger data sets, Clustal Omega outperforms other packages in terms of execution time and quality. Clustal Omega also has powerful features for adding sequences to and exploiting information in existing alignments, making use of the vast amount of precomputed information in public databases like Pfam.
Sponsorship
Science Foundation Ireland
Type of Material
Journal Article
Publisher
EMBO Press
Journal
Molecular Systems Biology
Volume
7
Issue
539
Start Page
1
End Page
6
Copyright (Published Version)
2011 EMBO and Macmillan Publishers Limited
Keywords
  • Bioinformatics

  • Hidden Markov models

  • Multiple sequence ali...

DOI
10.1038/msb.2011.75
Language
English
Status of Item
Peer reviewed
This item is made available under a Creative Commons License
https://creativecommons.org/licenses/by-nc-nd/3.0/ie/
Owning collection
Medicine Research Collection
Scopus© citations
9036
Acquisition Date
Jan 28, 2023
View Details
Views
2255
Last Week
1
Last Month
43
Acquisition Date
Jan 28, 2023
View Details
Downloads
519
Last Week
1
Last Month
132
Acquisition Date
Jan 28, 2023
View Details
google-scholar
University College Dublin Research Repository UCD
The Library, University College Dublin, Belfield, Dublin 4
Phone: +353 (0)1 716 7583
Fax: +353 (0)1 283 7667
Email: mailto:research.repository@ucd.ie
Guide: http://libguides.ucd.ie/rru

Built with DSpace-CRIS software - Extension maintained and optimized by 4Science

  • Cookie settings
  • Privacy policy
  • End User Agreement