Single-nucleotide polymorphisms in genes relating to homocysteine metabolism: how applicable are public SNP databases to a typical European population?

Open Access

20 October 2004

journal article
Published by Springer Science and Business Media LLC in European Journal of Human Genetics

Vol. 13 (1), 86-95
https://doi.org/10.1038/sj.ejhg.5201282

Abstract

To facilitate the association studies in complex diseases characterized by hyperhomocysteinemia, we collected structural and frequency data on single-nucleotide polymorphism (SNPs) in 24 genes relating to homocysteine metabolism. Firstly, we scanned approximately 1.2 Mbp of sequence in the NCBI SNP database (dbSNP) build 110 and we detected 1353 putative SNPs with an average in silico genic density of 1:683. Out of 112 putative SNPs in coding regions (cSNPs), we selected a subset of 42 cSNPs and we assessed the applicability of the NCBI dbSNP to the Czech population - a typical representative of European Caucasians - by determining the frequency of the putative cSNPs experimentally by PCR-RFLP or ARMS-PCR in at least 110 control Czech chromosomes. As only 25 of the 42 analyzed cSNPs met the criterion of >/=1% frequency, the positive predictive value of the NCBI data set for our population reached 60%, which is similar to other studies. The correlation of SNP frequency between Czechs and other Caucasians - obtained from NCBI and/or literature - was stronger (r(2)=0.90 for 20 cSNPs) than between Czechs and general NCBI database entries (r(2)=0.73 for 27 cSNPs). Moreover, frequencies of all 20 putative cSNPs, for which data in Caucasians were available, were congruently below or above the 1% frequency criterion both in Czechs and in other Caucasians. In summary, our study shows that the NCBI dbSNP is a useful tool for selecting cSNPs for genetic studies of hyperhomocysteinemia in European populations, although experimental validation of SNPs should be performed, especially if the cSNP entry lacks any frequency data in Caucasians.

Keywords

This publication has 27 references indexed in Scilit:

Quality and completeness of SNP databases
Nature Genetics, 2003
Characterization of 458 single nucleotide polymorphisms of disease candidate genes in the Korean population
Journal of Human Genetics, 2003
Cystathionine β-synthase polymorphisms and hyperhomocysteinaemia: an association study
European Journal of Human Genetics, 2003
A Polymorphism, R653Q, in the Trifunctional Enzyme Methylenetetrahydrofolate Dehydrogenase/Methenyltetrahydrofolate Cyclohydrolase/Formyltetrahydrofolate Synthetase Is a Maternal Genetic Risk Factor for Neural Tube Defects: Report of the Birth Defects Research Group
American Journal of Human Genetics, 2002
Data mining of public SNP databases for the selection of intragenic SNPs
Human Mutation, 2002
Single-nucleotide polymorphisms in the public domain: how useful are they?
Nature Genetics, 2001
A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms
Nature, 2001
Data mining: Efficiency of using sequence databases for polymorphism discovery
Human Mutation, 2001
dbSNP: the NCBI database of genetic variation
Nucleic Acids Research, 2001
Betaine-Homocysteine Methyltransferase (BHMT): Genomic Sequencing and Relevance to Hyperhomocysteinemia and Vascular Disease in Humans
Molecular Genetics and Metabolism, 2000

Cited by 14 articles