Automating resequencing-based detection of insertion-deletion polymorphisms
- 19 November 2006
- journal article
- technical report
- Published by Springer Nature in Nature Genetics
- Vol. 38 (12), 1457-1462
- https://doi.org/10.1038/ng1925
Abstract
Structural and insertion-deletion (indel) variants have received considerable recent attention, partly because of their phenotypic consequences. Among these variants, the most common are small indels ( ∼ 1–30 bp). Identifying and genotyping indels using sequence traces obtained from diploid samples requires extensive manual review, which makes large-scale studies inconvenient. We report a new algorithm, implemented in available software (PolyPhred version 6.0), to help automate detection and genotyping of indels from sequence traces. The algorithm identifies heterozygous individuals, which permits the discovery of low-frequency indels. It finds 80% of all indel polymorphisms with almost no false positives and finds 97% with a false discovery rate of 10%. Additionally, genotyping accuracy exceeds 99%, and it correctly infers indel length in 96% of the cases. Using this approach, we identify indels in the HapMap ENCODE regions, providing the first report of these polymorphisms in this data set.Keywords
This publication has 32 references indexed in Scilit:
- Common deletions and SNPs are in linkage disequilibrium in the human genomeNature Genetics, 2005
- Common deletion polymorphisms in the human genomeNature Genetics, 2005
- A high-resolution survey of deletion polymorphism in the human genomeNature Genetics, 2005
- Identification and functional characterization of a novel 27-bp deletion in the macroglycopeptide-coding region of the GPIBA gene resulting in platelet-type von Willebrand diseaseBlood, 2005
- Fine-scale structural variation of the human genomeNature Genetics, 2005
- Comprehensive identification and characterization of diallelic insertion–deletion polymorphisms in 330 human candidate genesHuman Molecular Genetics, 2004
- Familial PAX8 Small Deletion (c.989_992delACCC) Associated with Extreme Phenotype VariabilityJournal of Clinical Endocrinology & Metabolism, 2004
- Large-Scale Copy Number Polymorphism in the Human GenomeScience, 2004
- Functional annotation of a novel NFKB1 promoter polymorphism that increases risk for ulcerative colitisHuman Molecular Genetics, 2004
- On the formation of spontaneous deletions: The importance of short sequence homologies in the generation of large deletionsCell, 1982