Statistical analysis strategies for association studies involving rare variants
- 13 October 2010
- journal article
- research article
- Published by Springer Nature in Nature Reviews Genetics
- Vol. 11 (11), 773-785
- https://doi.org/10.1038/nrg2867
Abstract
We review the motivation for exploring the role of rare variants in phenotypic expression. There are several problems with capturing the effects of rare variants in association studies using current statistical analysis methods. We discuss the concept and use of collapsing sets of rare variants into predictors of phenotypic expression, to aid statistical analyses of rare variant associations. Functional annotations of specific variants and genomic regions can be used to define collapsed sets of rare variants. A range of statistical analysis models and inference-making procedures could be exploited to assess the association between rare variants and phenotypic expression. We discuss the relative merits of these approaches. We compare moving window and defined region approaches to the analysis of rare variant effects. We discuss the importance for rare variant analysis of the flexibility of statistical analysis models and methods in accommodating factors, including common variants, interactions between variants, beneficial and deleterious effects of variants and environmental factors.Keywords
This publication has 142 references indexed in Scilit:
- Interpretation of Association Signals and Identification of Causal Variants from Genome-wide Association StudiesAmerican Journal of Human Genetics, 2010
- Pooled Association Tests for Rare Variants in Exon-Resequencing StudiesAmerican Journal of Human Genetics, 2010
- Mapping Allele-Specific DNA Methylation: A New Tool for Maximizing Information from GWASAmerican Journal of Human Genetics, 2010
- Detecting rare variants for complex traits using family and unrelated dataGenetic Epidemiology, 2009
- An evaluation of statistical approaches to rare variant analysis in genetic association studiesGenetic Epidemiology, 2009
- Association tests using kernel‐based measures of multi‐locus genotype similarity between individualsGenetic Epidemiology, 2009
- Generalized linear modeling with regularization for detecting common disease rare haplotype associationGenetic Epidemiology, 2008
- Power comparisons between similarity‐based multilocus association methods, logistic regression, and score tests for haplotypesGenetic Epidemiology, 2008
- Understanding the accuracy of statistical haplotype inference with sequence data of known phaseGenetic Epidemiology, 2007
- Subsets of SNPs define rare genotype classes that predict ischemic heart diseaseHuman Genetics, 2006