Marker selection for genetic case–control association studies
- 23 April 2009
- journal article
- research article
- Published by Springer Nature in Nature Protocols
- Vol. 4 (5), 743-752
- https://doi.org/10.1038/nprot.2009.38
Abstract
Association studies can focus on candidate gene(s), a particular genomic region, or adopt a genome-wide association approach, each of which has implications for marker selection. The strategy for marker selection will affect the statistical power of the study to detect a disease association and is a crucial element of study design. The abundant single nucleotide polymorphisms (SNPs) are the markers of choice in genetic case--control association studies. The genotypes of neighboring SNPs are often highly correlated ('in linkage disequilibrium', LD) within a population, which is utilized for selecting specific 'tagSNPs' to serve as proxies for other nearby SNPs in high LD. General guidelines for SNP selection in candidate genes/regions and genome-wide studies are provided in this protocol, along with illustrative examples. Publicly available web-based resources are utilized to browse and retrieve data, and software, such as Haploview and Goldsurfer2, is applied to investigate LD and to select tagSNPs.Keywords
This publication has 23 references indexed in Scilit:
- Evaluating the Effects of Imputation on the Power, Coverage, and Cost Efficiency of Genome-wide SNP PlatformsAmerican Journal of Human Genetics, 2008
- Genome-wide association studies: progress and potential for drug discovery and developmentNature Reviews Drug Discovery, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Challenges and standards in integrating surveys of structural variationNature Genetics, 2007
- A new multipoint method for genome-wide association studies by imputation of genotypesNature Genetics, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- A haplotype map of the human genomeNature, 2005
- A map of human genome sequence variation containing 1.42 million single nucleotide polymorphismsNature, 2001
- Initial sequencing and analysis of the human genomeNature, 2001