Data mining: Efficiency of using sequence databases for polymorphism discovery

23 January 2001

journal article
research article
Published by Hindawi Limited in Human Mutation

Vol. 17 (2), 141-150
https://doi.org/10.1002/1098-1004(200102)17:2<141::aid-humu6>3.0.co;2-1

Abstract

An open question in research on Single Nucleotide Polymorphisms (SNPs) is, what is the percentage of true SNPs found by in silico pre-screening? To this end, we selected 13 genes, and determined the complete collection of “true” polymorphisms, or polymorphisms experimentally detected, existing in these genes in our laboratory using Denaturing High Performance Liquid Chromatography (DHPLC) and fluorescent sequencing, or in other laboratories using comparable methods. The genes studied by our group were PTGS2, IGFBP1, IGFBP3, and CYP19. GenBank sequence information was then aligned using two methods, and sequence differences termed “candidate” polymorphisms. We then compared the series of SNPs obtained experimentally and in silico and we have found that in silico methods are relatively specific (up to 55% of candidate SNPs found by SNPFinder have been discovered by experimental procedure) but have low sensitivity (not more than 27% of true SNPs are found by in silico methods). Hum Mutat 17:141–150, 2001.

Keywords

This publication has 18 references indexed in Scilit:

A candidate gene for psoriasis near HLA-C, HCR (Pg8), is highly polymorphic with a disease-associated susceptibility allele.
Human Molecular Genetics, 2000
Identification of Candidate Coding Region Single Nucleotide Polymorphisms in 165 Human Genes Using Assembled Expressed Sequence Tags
Genome Research, 1999
Characterization of single-nucleotide polymorphisms in coding regions of human genes
Nature Genetics, 1999
Patterns of single-nucleotide polymorphisms in candidate genes for blood-pressure homeostasis
Nature Genetics, 1999
Reliable identification of large numbers of candidate SNPs from public EST data
Nature Genetics, 1999
Genetic polymorphism of human O6-alkylguanine-DNA alkyltransferase
Pharmacogenetics, 1999
The P-selectin gene is highly polymorphic: reduced frequency of the Pro715 allele carriers in patients with myocardial infarction
Human Molecular Genetics, 1998
Single nucleotide polymorphism hunting in cyberspace
Human Mutation, 1998
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
Polymorphisms of the Transforming Growth Factor-β1 Gene in Relation to Myocardial Infarction and Blood Pressure
Hypertension, 1996

Cited by 20 articles