Accurate and comprehensive sequencing of personal genomes
Open Access
- 19 July 2011
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 21 (9), 1498-1505
- https://doi.org/10.1101/gr.123638.111
Abstract
As whole-genome sequencing becomes commoditized and we begin to sequence and analyze personal genomes for clinical and diagnostic purposes, it is necessary to understand what constitutes a complete sequencing experiment for determining genotypes and detecting single-nucleotide variants. Here, we show that the current recommendation of ∼30× coverage is not adequate to produce genotype calls across a large fraction of the genome with acceptably low error rates. Our results are based on analyses of a clinical sample sequenced on two related Illumina platforms, GAIIx and HiSeq 2000, to a very high depth (126×). We used these data to establish genotype-calling filters that dramatically increase accuracy. We also empirically determined how the callable portion of the genome varies as a function of the amount of sequence data used. These results help provide a “sequencing guide” for future whole-genome sequencing decisions and metrics by which coverage statistics should be reported.Keywords
This publication has 29 references indexed in Scilit:
- A framework for variation discovery and genotyping using next-generation DNA sequencing dataNature Genetics, 2011
- Efficient study design for next generation sequencingGenetic Epidemiology, 2011
- A map of human genome variation from population-scale sequencingNature, 2010
- Discovery of common Asian copy number variants using integrated high-resolution array CGH and massively parallel DNA sequencingNature Genetics, 2010
- A comprehensive catalogue of somatic mutations from a human cancer genomeNature, 2009
- A highly annotated whole-genome sequence of a Korean individualNature, 2009
- Accurate whole human genome sequencing using reversible terminator chemistryNature, 2008
- The diploid genome sequence of an Asian individualNature, 2008
- Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencingNature Genetics, 2008
- Finishing the euchromatic sequence of the human genomeNature, 2004