PatMaN: rapid alignment of short sequences to large databases
Top Cited Papers
Open Access
- 8 May 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (13), 1530-1531
- https://doi.org/10.1093/bioinformatics/btn223
Abstract
Summary: We present a tool suited for searching for many short nucleotide sequences in large databases, allowing for a predefined number of gaps and mismatches. The commandline-driven program implements a non-deterministic automata matching algorithm on a keyword tree of the search strings. Both queries with and without ambiguity codes can be searched. Search time is short for perfect matches, and retrieval time rises exponentially with the number of edits allowed. Availability: The C++ source code for PatMaN is distributed under the GNU General Public License and has been tested on the GNU/Linux operating system. It is available from http://bioinf.eva.mpg.de/patman. Contact:pruefer@eva.mpg.de Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 5 references indexed in Scilit:
- Flexible Pattern Matching in StringsPublished by Cambridge University Press (CUP) ,2002
- Basic local alignment search toolJournal of Molecular Biology, 1990
- A finite state machine algorithm for finding restriction sites and other pattern matching applicationsBioinformatics, 1988
- Improved programs for DNA and protein sequence analysis on the IBM personal computer and other standard computer systemsNucleic Acids Research, 1986
- Efficient string matchingCommunications of the ACM, 1975