Discovery and genotyping of genome structural polymorphism by sequencing on a population scale
Open Access
- 13 February 2011
- journal article
- research article
- Published by Springer Science and Business Media LLC in Nature Genetics
- Vol. 43 (3), 269-276
- https://doi.org/10.1038/ng.768
Abstract
Steven McCarroll and colleagues report an analytical framework for characterizing genome deletion polymorphism in populations, applied here to the low coverage genome sequences of 168 individuals from the 1000 Genomes Project. Their population-aware analysis enables structural inference with greater accuracy than previous methods. Accurate and complete analysis of genome variation in large populations will be required to understand the role of genome variation in complex disease. We present an analytical framework for characterizing genome deletion polymorphism in populations using sequence data that are distributed across hundreds or thousands of genomes. Our approach uses population-level concepts to reinterpret the technical features of sequence data that often reflect structural variation. In the 1000 Genomes Project pilot, this approach identified deletion polymorphism across 168 genomes (sequenced at 4× average coverage) with sensitivity and specificity unmatched by other algorithms. We also describe a way to determine the allelic state or genotype of each deletion polymorphism in each genome; the 1000 Genomes Project used this approach to type 13,826 deletion polymorphisms (48–995,664 bp) at high accuracy in populations. These methods offer a way to relate genome structural polymorphism to complex disease in populations.Keywords
This publication has 27 references indexed in Scilit:
- Genotype ImputationAnnual Review of Genomics and Human Genetics, 2009
- Personalized copy number and segmental duplication maps using next-generation sequencingNature Genetics, 2009
- Sensitive and accurate detection of copy number variants using read depth of coverageGenome Research, 2009
- Six new loci associated with body mass index highlight a neuronal influence on body weight regulationNature Genetics, 2008
- Integrated detection and population-genetic analysis of SNPs and copy number variationNature Genetics, 2008
- Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's diseaseNature Genetics, 2008
- Mapping short DNA sequencing reads and calling variants using mapping quality scoresGenome Research, 2008
- A new multipoint method for genome-wide association studies by imputation of genotypesNature Genetics, 2007
- A haplotype map of the human genomeNature, 2005
- Fine-scale structural variation of the human genomeNature Genetics, 2005