SeqMap: mapping massive amount of oligonucleotides to the genome
Top Cited Papers
- 12 August 2008
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 24 (20), 2395-2396
- https://doi.org/10.1093/bioinformatics/btn429
Abstract
SeqMap is a tool for mapping large amount of short sequences to the genome. It is designed for finding all the places in a reference genome where each sequence may come from. This task is essential to the analysis of data from ultra high-throughput sequencing machines. With a carefully designed index-filtering algorithm and an efficient implementation, SeqMap can map tens of millions of short sequences to a genome of several billions of nucleotides. Multiple substitutions and insertions/deletions of the nucleotide bases in the sequences can be tolerated and therefore detected. SeqMap supports FASTA input format and various output formats, and provides command line options for tuning almost every aspect of the mapping process. A typical mapping can be done in a few hours on a desktop PC. Parallel use of SeqMap on a cluster is also very straightforward.Keywords
This publication has 7 references indexed in Scilit:
- MADS: A new and improved method for analysis of differential alternative splicing by exon-tiling microarraysRNA, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Using quality scores and longer reads improves accuracy of Solexa read mappingBMC Bioinformatics, 2008
- SOAP: short oligonucleotide alignment programBioinformatics, 2008
- Detecting near-duplicates for web crawlingPublished by Association for Computing Machinery (ACM) ,2007
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997