A Greedy Algorithm for Aligning DNA Sequences
Top Cited Papers
- 1 February 2000
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 7 (1-2), 203-214
- https://doi.org/10.1089/10665270050081478
Abstract
For aligning DNA sequences that differ only by sequencing errors, or by equivalent errors from other sources, a greedy algorithm can be much faster than traditional dynamic programming approaches and yet produce an alignment that is guaranteed to be theoretically optimal. We introduce a new greedy alignment algorithm with particularly good performance and show that it computes the same alignment as does a certain dynamic programming algorithm, while executing over 10 times faster on appropriate data. An implementation of this algorithm is currently used in a program that assembles the UniGene database at the National Center for Biotechnology Information.Keywords
This publication has 18 references indexed in Scilit:
- Alignment of whole genomesNucleic Acids Research, 1999
- A Computer Program for Aligning a cDNA Sequence with a Genomic DNA SequenceGenome Research, 1998
- Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequenceNature, 1998
- Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy AssessmentGenome Research, 1998
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Generation and analysis of 280,000 human expressed sequence tags.Genome Research, 1996
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Sequence comparison with concave weighting functionsBulletin of Mathematical Biology, 1988
- A file comparison programSoftware: Practice and Experience, 1985
- An improved algorithm for matching biological sequencesJournal of Molecular Biology, 1982