MaCH: using sequence and genotype data to estimate haplotypes and unobserved genotypes

Top Cited Papers

Open Access

5 November 2010

journal article
research article
Published by Wiley in Genetic Epidemiology

Vol. 34 (8), 816-834
https://doi.org/10.1002/gepi.20533

Abstract

Genome‐wide association studies (GWAS) can identify common alleles that contribute to complex disease susceptibility. Despite the large number of SNPs assessed in each study, the effects of most common SNPs must be evaluated indirectly using either genotyped markers or haplotypes thereof as proxies. We have previously implemented a computationally efficient Markov Chain framework for genotype imputation and haplotyping in the freely available MaCH software package. The approach describes sampled chromosomes as mosaics of each other and uses available genotype and shotgun sequence data to estimate unobserved genotypes and haplotypes, together with useful measures of the quality of these estimates. Our approach is already widely used to facilitate comparison of results across studies as well as meta‐analyses of GWAS. Here, we use simulations and experimental genotypes to evaluate its accuracy and utility, considering choices of genotyping panels, reference panel configurations, and designs where genotyping is replaced with shotgun sequencing. Importantly, we show that genotype imputation not only facilitates cross study analyses but also increases power of genetic association studies. We show that genotype imputation of common variants using HapMap haplotypes as a reference is very accurate using either genome‐wide SNP data or smaller amounts of data typical in fine‐mapping studies. Furthermore, we show the approach is applicable in a variety of populations. Finally, we illustrate how association analyses of unobserved variants will benefit from ongoing advances such as larger HapMap reference panels and whole genome shotgun sequencing technologies.Genet. Epidemiol. 34: 816‐834, 2010.

Keywords

This publication has 52 references indexed in Scilit:

A genome-wide association study on African-ancestry populations for asthma
Journal of Allergy and Clinical Immunology, 2009
Genotype-Imputation Accuracy across Worldwide Human Populations
American Journal of Human Genetics, 2009
Six new loci associated with body mass index highlight a neuronal influence on body weight regulation
Nature Genetics, 2008
Identification of ten loci associated with height highlights new biological pathways in human growth
Nature Genetics, 2008
A second generation human haplotype map of over 3.1 million SNPs
Nature, 2007
A new multipoint method for genome-wide association studies by imputation of genotypes
Nature Genetics, 2007
A worldwide survey of haplotype variation and linkage disequilibrium in the human genome
Nature Genetics, 2006
In silico method for inferring genotypes in pedigrees
Nature Genetics, 2006
A haplotype map of the human genome
Nature, 2005
Efficiency and power in genetic association studies
Nature Genetics, 2005

Cited by 1783 articles