Detection of splice junctions from paired-end RNA-seq data by SpliceMap
Open Access
- 5 April 2010
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 38 (14), 4570-4578
- https://doi.org/10.1093/nar/gkq211
Abstract
Alternative splicing is a prevalent post-transcriptional process, which is not only important to normal cellular function but is also involved in human diseases. The newly developed second generation sequencing technique provides high-throughput data (RNA-seq data) to study alternative splicing events in different types of cells. Here, we present a computational method, SpliceMap, to detect splice junctions from RNA-seq data. This method does not depend on any existing annotation of gene structures and is capable of finding novel splice junctions with high sensitivity and specificity. It can handle long reads (50–100 nt) and can exploit paired-read information to improve mapping accuracy. Several parameters are included in the output to indicate the reliability of the predicted junction and help filter out false predictions. We applied SpliceMap to analyze 23 million paired 50-nt reads from human brain tissue. The results show at this depth of sequencing, RNA-seq can support reliable detection of splice junctions except for those that are present at very low level. Compared to current methods, SpliceMap can achieve 12% higher sensitivity without sacrificing specificity.Keywords
This publication has 24 references indexed in Scilit:
- TopHat: discovering splice junctions with RNA-SeqBioinformatics, 2009
- RNA-Seq: a revolutionary tool for transcriptomicsNature Reviews Genetics, 2009
- Alternative isoform regulation in human tissue transcriptomesNature, 2008
- SeqMap: mapping massive amount of oligonucleotides to the genomeBioinformatics, 2008
- RNA-seq: An assessment of technical reproducibility and comparison with gene expression arraysGenome Research, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- Highly Integrated Single-Base Resolution Maps of the Epigenome in ArabidopsisCell, 2008
- Understanding alternative splicing: towards a cellular codeNature Reviews Molecular Cell Biology, 2005
- NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteinsNucleic Acids Research, 2004
- dbEST — database for “expressed sequence tags”Nature Genetics, 1993