The Power of Association Studies to Detect the Contribution of Candidate Genetic Loci to Variation in Complex Traits
- 1 August 1999
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 9 (8), 720-731
- https://doi.org/10.1101/gr.9.8.720
Abstract
The statistical power of five association study test statistics (two haplotype-based tests, two marker-based tests, and the Transmission Disequilibrium Test–Q5) to detect single nucleotide polymorphism (SNP)/phenotype associations in a linkage–disequilibrium-based candidate gene scan employing a number of SNPs is examined. Power is estimated as a function of realistic parameters expected to affect the likelihood of detecting a significant association: the number of SNPs examined, the scaled recombination size of the region examined, the proportion of variance in the trait attributable to a hidden causative polymorphism within the region, and the number of individuals or families examined. For the different combinations of parameter values, power is estimated from a large number of realizations of a simulated coalescent describing a single random mating population with mutation, random genetic drift, and recombination. This explicit population genetics model results in a distribution of DNA marker heterozygosities and linkage disequilibria that are likely to resemble those expected in actual population samples. The study concludes that (1) marker-based permutation tests are more powerful than simple haplotype-based tests, (2) there is sufficient power to detect the presence of causative polymorphisms of small effect if on the order of 500 individuals are sampled, (3) greater power is achieved by increasing the sample size than by increasing the number of polymorphisms, (4) association studies are generally more powerful than transmission disequilibrium-based tests, and (5) for the range of parameters considered association studies have a low repeatability unless sample sizes are on the order of 500 individuals. Estimates of 4Nc for a number of gene regions and human populations will be of use in determining the density of SNPs that are likely to be required for successful association studies.This publication has 32 references indexed in Scilit:
- Haplotype Structure and Population Genetic Inferences from Nucleotide-Sequence Variation in Human Lipoprotein LipaseAmerican Journal of Human Genetics, 1998
- Large-Scale Identification, Mapping, and Genotyping of Single-Nucleotide Polymorphisms in the Human GenomeScience, 1998
- Variations on a Theme: Cataloging Human DNA Sequence VariationScience, 1997
- The Future of Genetic Studies of Complex Human DiseasesScience, 1996
- Ancestral Inference from Samples of DNA Sequences with RecombinationJournal of Computational Biology, 1996
- Naturally Occurring Variation in Bristle Number and DNA Polymorphisms at the scabrous Locus of Drosophila melanogasterScience, 1994
- Genetic Dissection of Complex TraitsScience, 1994
- Estimating the recombination parameter of a finite population model without selectionGenetics Research, 1987
- Properties of a neutral allele model with intragenic recombinationTheoretical Population Biology, 1983
- Linkage disequilibrium in finite populationsTheoretical and Applied Genetics, 1968