Variance component model to account for sample structure in genome-wide association studies
Top Cited Papers
- 7 March 2010
- journal article
- research article
- Published by Springer Nature in Nature Genetics
- Vol. 42 (4), 348-354
- https://doi.org/10.1038/ng.548
Abstract
Eleazar Eskin and colleagues report a variance component model for correcting for sample structure in association studies. The EMMAX program is publicly available and may be used for analysis of genome-wide association study datasets. Although genome-wide association studies (GWASs) have identified numerous loci associated with complex traits, imprecise modeling of the genetic relatedness within study samples may cause substantial inflation of test statistics and possibly spurious associations. Variance component approaches, such as efficient mixed-model association (EMMA), can correct for a wide range of sample structures by explicitly accounting for pairwise relatedness between individuals, using high-density markers to model the phenotype distribution; but such approaches are computationally impractical. We report here a variance component approach implemented in publicly available software, EMMA eXpedited (EMMAX), that reduces the computational time for analyzing large GWAS data sets from years to hours. We apply this method to two human GWAS data sets, performing association analysis for ten quantitative traits from the Northern Finland Birth Cohort and seven common diseases from the Wellcome Trust Case Control Consortium. We find that EMMAX outperforms both principal component analysis and genomic control in correcting for sample structure.Keywords
This publication has 56 references indexed in Scilit:
- Finding the missing heritability of complex diseasesNature, 2009
- Case‐control association testing in the presence of unknown relationshipsGenetic Epidemiology, 2009
- Genotype‐based matching to correct for population stratification in large‐scale case‐control genetic association studiesGenetic Epidemiology, 2009
- The Genome-wide Patterns of Variation Expose Significant Substructure in a Founder PopulationAmerican Journal of Human Genetics, 2008
- Estimation of significance thresholds for genomewide association scansGenetic Epidemiology, 2008
- Genome-wide association study identifies novel breast cancer susceptibility lociNature, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A high-resolution HLA and SNP haplotype map for disease association studies in the extended human MHCNature Genetics, 2006
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- A unified mixed-model method for association mapping that accounts for multiple levels of relatednessNature Genetics, 2005