Full-length transcriptome assembly from RNA-Seq data without a reference genome
Top Cited Papers
Open Access
- 15 May 2011
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 29 (7), 644-652
- https://doi.org/10.1038/nbt.1883
Abstract
Reconstructing full-length transcripts from high-throughput RNA sequencing data is difficult without a reference genome sequence. Grabherr et al. describe Trinity, an algorithm for assembling full-length transcripts from short reads without first mapping the reads to a genome sequence. Massively parallel sequencing of cDNA has enabled deep and efficient probing of transcriptomes. Current approaches for transcript reconstruction from such data often rely on aligning reads to a reference genome, and are thus unsuitable for samples with a partial or missing reference genome. Here we present the Trinity method for de novo assembly of full-length transcripts and evaluate it on samples from fission yeast, mouse and whitefly, whose reference genome is not yet available. By efficiently constructing and analyzing sets of de Bruijn graphs, Trinity fully reconstructs a large fraction of transcripts, including alternatively spliced isoforms and transcripts from recently duplicated genes. Compared with other de novo transcriptome assemblers, Trinity recovers more full-length transcripts across a broad range of expression levels, with a sensitivity similar to methods that rely on genome alignments. Our approach provides a unified solution for transcriptome reconstruction in any sample, especially in the absence of a reference genome.Keywords
This publication has 35 references indexed in Scilit:
- Comprehensive comparative analysis of strand-specific RNA sequencing methodsNature Methods, 2010
- Ab initio reconstruction of cell type–specific transcriptomes in mouse reveals the conserved multi-exonic structure of lincRNAsNature Biotechnology, 2010
- Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiationNature Biotechnology, 2010
- Genome-wide synteny through highly sensitive sequence alignment: SatsumaBioinformatics, 2010
- Ab initio construction of a eukaryotic transcriptome by massively parallel mRNA sequencingProceedings of the National Academy of Sciences, 2009
- Bidirectional promoters generate pervasive transcription in yeastNature, 2009
- Alternative isoform regulation in human tissue transcriptomesNature, 2008
- Natural history and evolutionary principles of gene duplication in fungiNature, 2007
- Understanding alternative splicing: towards a cellular codeNature Reviews Molecular Cell Biology, 2005
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002