Identification of genetic variants using bar-coded multiplexed sequencing
Open Access
- 14 September 2008
- journal article
- research article
- Published by Springer Nature in Nature Methods
- Vol. 5 (10), 887-893
- https://doi.org/10.1038/nmeth.1251
Abstract
Targeted regions of the human genome are resequenced in multiplex with Illumina technology, and the pipeline is evaluated for polymorphism discovery and genotyping. We developed a generalized framework for multiplexed resequencing of targeted human genome regions on the Illumina Genome Analyzer using degenerate indexed DNA bar codes ligated to fragmented DNA before sequencing. Using this method, we simultaneously sequenced the DNA of multiple HapMap individuals at several Encyclopedia of DNA Elements (ENCODE) regions. We then evaluated the use of Bayes factors for discovering and genotyping polymorphisms. For polymorphisms that were either previously identified within the Single Nucleotide Polymorphism database (dbSNP) or visually evident upon re-inspection of archived ENCODE traces, we observed a false positive rate of 11.3% using strict thresholds for predicting variants and 69.6% for lax thresholds. Conversely, false negative rates were 10.8–90.8%, with false negatives at stricter cut-offs occurring at lower coverage (90% of genetic variants are discoverable using multiplexed sequencing provided sufficient coverage at the polymorphic base.Keywords
This publication has 30 references indexed in Scilit:
- Error-correcting barcoded primers for pyrosequencing hundreds of samples in multiplexNature Methods, 2008
- Genome-wide in situ exon capture for selective resequencingNature Genetics, 2007
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Direct selection of human genomic loci by microarray hybridizationNature Methods, 2007
- Multiplex amplification of large sets of human exonsNature Methods, 2007
- Microarray-based genomic selection for high-throughput resequencingNature Methods, 2007
- A pyrosequencing-tailored nucleotide barcode design unveils opportunities for large-scale sample multiplexingNucleic Acids Research, 2007
- Targeted high-throughput sequencing of tagged nucleic acid samplesNucleic Acids Research, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007