Velvet: Algorithms for de novo short read assembly using de Bruijn graphs
Top Cited Papers
- 18 March 2008
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 18 (5), 821-829
- https://doi.org/10.1101/gr.074492.107
Abstract
We have developed a new set of algorithms, collectively called “Velvet,” to manipulate de Bruijn graphs for genomic sequence assembly. A de Bruijn graph is a compact representation based on short words (k-mers) that is ideal for high coverage, very short read (25–50 bp) data sets. Applying Velvet to very short reads and paired-ends information only, one can produce contigs of significant length, up to 50-kb N50 length in simulations of prokaryotic data and 3-kb N50 on simulated mammalian BACs. When applied to real Solexa data sets without read pairs, Velvet generated contigs of ∼8 kb in a prokaryote and 2 kb in a mammalian BAC, in close agreement with our simulated results without read-pair information. Velvet represents a new approach to assembly that can leverage very short reads in combination with read pairs to produce useful assemblies.Keywords
This publication has 28 references indexed in Scilit:
- Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolutionNature Genetics, 2007
- SHARCGS, a fast and highly accurate short-read assembly algorithm for de novo genomic sequencingGenome Research, 2007
- Genome-Wide Mapping of in Vivo Protein-DNA InteractionsScience, 2007
- Whole-Genome Sequencing and Assembly with High-Throughput, Short-Read TechnologiesPLOS ONE, 2007
- The fragment assembly string graphBioinformatics, 2005
- The Atlas Genome Assembly SystemGenome Research, 2004
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- A Whole-Genome Assembly of DrosophilaScience, 2000
- A New Algorithm for DNA Sequence AssemblyJournal of Computational Biology, 1995