Database mining for selection of SNP markers useful in admixture mapping
Open Access
- 14 February 2009
- journal article
- research article
- Published by Springer Nature in BioData Mining
- Vol. 2 (1), 1
- https://doi.org/10.1186/1756-0381-2-1
Abstract
New technologies make it possible for the first time to genotype hundreds of thousands of SNPs simultaneously. A wealth of genomic information in the form of publicly available databases is underutilized as a potential resource for uncovering functionally relevant markers underlying complex human traits. Given the huge amount of SNP data available from the annotation of human genetic variation, data mining is a reasonable approach to investigating the number of SNPs that are informative for ancestry information. The distribution and density of SNPs across the genome of African and European populations were extensively investigated by using the HapMap, Affymetrix, and Illumina SNP databases. We exploited these resources by mining the data available from each of these databases to prioritize potential candidate SNPs useful for admixture mapping in complex human diseases and traits. Over 4 million SNPs were compared between Africans and Europeans on the basis of a pre-specified recommended allele frequency difference (delta) value of >or= 0.3. The method identified 15% of HapMap, 11% of Affymetrix, and 14% of Illumina SNP sets as candidate SNPs, termed ancestry informative markers (AIMs). These AIM panels with assigned rs numbers, allele frequencies in each ethnic group, delta value, and map positions are all posted on our website http://www.ssg.uab.edu/downloads/admixture_mapping/SNPAIMs.txt. All marker information in this data set is freely and publicly available without restriction. The selected SNP sets represent valuable resources for admixture mapping studies. The overlap between selected AIMs by this single measure of marker informativeness in the different platforms is discussed.Keywords
This publication has 37 references indexed in Scilit:
- Relative Impact of Nucleotide and Copy Number Variation on Gene Expression PhenotypesScience, 2007
- Will admixture mapping work to find disease genes?Philosophical Transactions Of The Royal Society B-Biological Sciences, 2005
- Mapping by admixture linkage disequilibrium: advances, limitations and guidelinesNature Reviews Genetics, 2005
- Genome-wide association studies for common diseases and complex traitsNature Reviews Genetics, 2005
- Ethnic-Difference Markers for Use in Mapping by Admixture Linkage DisequilibriumAmerican Journal of Human Genetics, 2002
- Markers for Mapping by Admixture Linkage Disequilibrium in African American and Hispanic PopulationsAmerican Journal of Human Genetics, 2001
- Estimating African American Admixture Proportions by Use of Population-Specific AllelesAmerican Journal of Human Genetics, 1998
- Ethnic-affiliation estimation by use of population-specific DNA markers.1997
- Mapping genes underlying ethnic differences in disease risk by linkage disequilibrium in recently admixed populations.1997
- Mapping by admixture linkage disequilibrium in human populations: limits and guidelines.1994