Prediction of individual genetic risk to disease from genome-wide association studies
Top Cited Papers
Open Access
- 4 September 2007
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 17 (10), 1520-1528
- https://doi.org/10.1101/gr.6665407
Abstract
Empirical studies suggest that the effect sizes of individual causal risk alleles underlying complex genetic diseases are small, with most genotype relative risks in the range of 1.1–2.0. Although the increased risk of disease for a carrier is small for any single locus, knowledge of multiple-risk alleles throughout the genome could allow the identification of individuals that are at high risk. In this study, we investigate the number and effect size of risk loci that underlie complex disease constrained by the disease parameters of prevalence and heritability. Then we quantify the value of prediction of genetic risk to disease using a range of realistic combinations of the number, size, and distribution of risk effects that underlie complex diseases. We propose an approach to assess the genetic risk of a disease in healthy individuals, based on dense genome-wide SNP panels. We test this approach using simulation. When the number of loci contributing to the disease is >50, a large case-control study is needed to identify a set of risk loci for use in predicting the disease risk of healthy people not included in the case-control study. For diseases controlled by 1000 loci of mean relative risk of only 1.04, a case-control study with 10,000 cases and controls can lead to selection of ∼75 loci that explain >50% of the genetic variance. The 5% of people with the highest predicted risk are three to seven times more likely to suffer the disease than the population average, depending on heritability and disease prevalence. Whether an individual with known genetic risk develops the disease depends on known and unknown environmental factors.Keywords
This publication has 38 references indexed in Scilit:
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A genome-wide association study identifies novel risk loci for type 2 diabetesNature, 2007
- A common coding variant in CASP8 is associated with breast cancer riskNature Genetics, 2007
- Systematic meta-analyses of Alzheimer disease genetic association studies: the AlzGene databaseNature Genetics, 2007
- Genome-wide genetic association of complex traits in heterogeneous stock miceNature Genetics, 2006
- Mapping complex disease loci in whole-genome association studiesNature, 2004
- Statistical significance for genomewide studiesProceedings of the National Academy of Sciences, 2003
- A vision for the future of genomics researchNature, 2003
- Understanding quantitative genetic variationNature Reviews Genetics, 2002
- Population genetics—making sense out of sequenceNature Genetics, 1999