SPAdes: A New Genome Assembly Algorithm and Its Applications to Single-Cell Sequencing
Top Cited Papers
- 1 May 2012
- journal article
- research article
- Published by Mary Ann Liebert Inc in Journal of Computational Biology
- Vol. 19 (5), 455-477
- https://doi.org/10.1089/cmb.2012.0021
Abstract
The lion's share of bacteria in various environments cannot be cloned in the laboratory and thus cannot be sequenced using existing technologies. A major goal of single-cell genomics is to complement gene-centric metagenomic data with whole-genome assemblies of uncultivated organisms. Assembly of single-cell data is challenging because of highly non-uniform read coverage as well as elevated levels of sequencing errors and chimeric reads. We describe SPAdes, a new assembler for both single-cell and standard (multicell) assembly, and demonstrate that it improves on the recently released E+V−SC assembler (specialized for single-cell data) and on popular assemblers Velvet and SoapDeNovo (for multicell data). SPAdes generates single-cell assemblies, providing information about genomes of uncultivatable bacteria that vastly exceeds what may be obtained via traditional metagenomics studies. SPAdes is available online (http://bioinf.spbau.ru/spades). It is distributed as open source software.Keywords
This publication has 41 references indexed in Scilit:
- Single-cell dissection of transcriptional heterogeneity in human colon tumorsNature Biotechnology, 2011
- Paired de Bruijn Graphs: A Novel Approach for Incorporating Mate Pair Information into Genome AssemblersJournal of Computational Biology, 2011
- Efficient de novo assembly of single-cell bacterial genomes from short-read data setsNature Biotechnology, 2011
- Error correction of high-throughput sequencing datasets with non-uniform coverageBioinformatics, 2011
- Whole-genome molecular haplotyping of single cellsNature Biotechnology, 2011
- High-quality draft assemblies of mammalian genomes from massively parallel sequence dataProceedings of the National Academy of Sciences, 2010
- Automated de novo protein sequencing of monoclonal antibodiesNature Biotechnology, 2008
- Genomic sequencing of single microbial cells from environmental samplesCurrent Opinion in Microbiology, 2008
- Dissecting biological “dark matter” with single-cell genetic analysis of rare and uncultivated TM7 microbes from the human mouthProceedings of the National Academy of Sciences, 2007
- A New Algorithm for DNA Sequence AssemblyJournal of Computational Biology, 1995