MOM: maximum oligonucleotide mapping
- 19 February 2009
- journal article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (7), 969-970
- https://doi.org/10.1093/bioinformatics/btp092
Abstract
Current short read mapping programs are based on the reasonable premise that most sequencing errors occur near the 3(') end of the read. These programs map reads with either a small number of mismatches in the entire read, or a small number of mismatches in the segment remaining after trimming bases from the 3(') end or a single base from the 5(') end. Though multiple sequencing errors most likely occur near the 3(') end of the reads, they can still occur at the 5(') end of the reads. Trimming from the 3(') end will not be able to map these reads. We have developed a program, Maximum Oligonucleotide Mapping (MOM), based on the concept of query matching that is designed to capture a maximal length match within the short read satisfying the user defined error parameters. This query matching approach thus accommodates multiple sequencing errors at both ends. We demonstrate that this technique achieves greater sensitivity and a higher percentage of uniquely mapped reads when compared to existing programs such as SOAP, MAQ and SHRiMP. Software and Test Data http://mom.csbc.vcu.edu.Keywords
This publication has 3 references indexed in Scilit:
- Substantial biases in ultra-short read data sets from high-throughput DNA sequencingNucleic Acids Research, 2008
- SOAP: short oligonucleotide alignment programBioinformatics, 2008
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997