Haplotype-resolved genome sequencing of a Gujarati Indian individual
Top Cited Papers
- 1 January 2011
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 29 (1), 59-63
- https://doi.org/10.1038/nbt.1740
Abstract
Sequencing a human genome using next-generation methods does not distinguish between the two copies of each chromosome. Kitzman et al. determine a haplotype-resolved genome sequence by efficiently constructing and sequencing long-insert clones that cover the diploid genome with a low likelihood of overlap. Haplotype information is essential to the complete description and interpretation of genomes1, genetic diversity2 and genetic ancestry3. Although individual human genome sequencing is increasingly routine4, nearly all such genomes are unresolved with respect to haplotype. Here we combine the throughput of massively parallel sequencing5 with the contiguity information provided by large-insert cloning6 to experimentally determine the haplotype-resolved genome of a South Asian individual. A single fosmid library was split into a modest number of pools, each providing ∼3% physical coverage of the diploid genome. Sequencing of each pool yielded reads overwhelmingly derived from only one homologous chromosome at any given location. These data were combined with whole-genome shotgun sequence to directly phase 94% of ascertained heterozygous single nucleotide polymorphisms (SNPs) into long haplotype blocks (N50 of 386 kilobases (kbp)). This method also facilitates the analysis of structural variation, for example, to anchor novel insertions7,8 to specific locations and haplotypes.Keywords
This publication has 33 references indexed in Scilit:
- A map of human genome variation from population-scale sequencingNature, 2010
- Integrating common and rare genetic variation in diverse human populationsNature, 2010
- Exome sequencing identifies the cause of a mendelian disorderNature Genetics, 2009
- Personalized copy number and segmental duplication maps using next-generation sequencingNature Genetics, 2009
- Targeted capture and massively parallel sequencing of 12 human exomesNature, 2009
- The diploid genome sequence of an Asian individualNature, 2008
- Next-generation DNA sequencingNature Biotechnology, 2008
- Evolutionary toggling of the MAPT 17q21.31 inversion regionNature Genetics, 2008
- Mapping and sequencing of structural variation from eight human genomesNature, 2008
- Initial sequencing and analysis of the human genomeNature, 2001