Case‐control association testing in the presence of unknown relationships

30 March 2009

journal article
research article
Published by Wiley in Genetic Epidemiology

Vol. 33 (8), 668-678
https://doi.org/10.1002/gepi.20418

Abstract

Genome‐wide association studies result in inflated false‐positive results when unrecognized cryptic relatedness exists. A number of methods have been proposed for testing association between markers and disease with a correction for known pedigree‐based relationships. However, in most case‐control studies, relationships are generally unknown, yet the design is predicated on the assumption of at least ancestral relatedness among cases. Here, we focus on adjusting cryptic relatedness when the genealogy of the sample is unknown, particularly in the context of samples from isolated populations where cryptic relatedness may be problematic. We estimate cryptic relatedness using maximum‐likelihood methods and use a corrected χ² test with estimated kinship coefficients for testing in the context of unknown cryptic relatedness. Estimated kinship coefficients characterize precisely the relatedness between truly related people, but are biased for unrelated pairs. The proposed test substantially reduces spurious positive results, producing a uniform null distribution of P‐values. Especially with missing pedigree information, estimated kinship coefficients can still be used to correct non‐independence among individuals. The corrected test was applied to real data sets from genetic isolates and created a distribution of P‐value that was close to uniform. Thus, the proposed test corrects the non‐uniform distribution of P‐values obtained with the uncorrected test and illustrates the advantage of the approach on real data. Genet. Epidemiol. 33:668–678, 2009.

Keywords

This publication has 48 references indexed in Scilit:

Case-Control Association Testing with Related Individuals: A More Powerful Quasi-Likelihood Score Test
American Journal of Human Genetics, 2007
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
Nature, 2007
A Maximum-Likelihood Method for the Estimation of Pairwise Relatedness in Structured Populations
Genetics, 2007
Principal components analysis corrects for stratification in genome-wide association studies
Nature Genetics, 2006
Confounding from Cryptic Relatedness in Case-Control Association Studies
PLoS Genetics, 2005
Complex trait mapping in isolated populations: Are specific statistical methods required?
European Journal of Human Genetics, 2005
Mapping complex disease loci in whole-genome association studies
Nature, 2004
Novel Case-Control Test in a Founder Population Identifies P-Selectin as an Atopy-Susceptibility Locus
American Journal of Human Genetics, 2003
Positional Cloning of Disease Genes: Advantages of Genetic Isolates
Human Heredity, 1999
Genome Screens Using Linkage Disequilibrium Tests: Optimal Marker Characteristics and Feasibility
American Journal of Human Genetics, 1998

Cited by 91 articles