A Greedy Algorithm for Aligning DNA Sequences

Top Cited Papers

1 February 2000

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 7 (1-2), 203-214
https://doi.org/10.1089/10665270050081478

Abstract

For aligning DNA sequences that differ only by sequencing errors, or by equivalent errors from other sources, a greedy algorithm can be much faster than traditional dynamic programming approaches and yet produce an alignment that is guaranteed to be theoretically optimal. We introduce a new greedy alignment algorithm with particularly good performance and show that it computes the same alignment as does a certain dynamic programming algorithm, while executing over 10 times faster on appropriate data. An implementation of this algorithm is currently used in a program that assembles the UniGene database at the National Center for Biotechnology Information.

Keywords

This publication has 18 references indexed in Scilit:

Alignment of whole genomes
Nucleic Acids Research, 1999
A Computer Program for Aligning a cDNA Sequence with a Genomic DNA Sequence
Genome Research, 1998
Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence
Nature, 1998
Base-Calling of Automated Sequencer Traces UsingPhred. I. Accuracy Assessment
Genome Research, 1998
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
Generation and analysis of 280,000 human expressed sequence tags.
Genome Research, 1996
Basic Local Alignment Search Tool
Journal of Molecular Biology, 1990
Sequence comparison with concave weighting functions
Bulletin of Mathematical Biology, 1988
A file comparison program
Software: Practice and Experience, 1985
An improved algorithm for matching biological sequences
Journal of Molecular Biology, 1982

Cited by 4259 articles