Detecting single-feature polymorphisms using oligonucleotide arrays and robustified projection pursuit
Open Access
- 23 August 2005
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 21 (20), 3852-3858
- https://doi.org/10.1093/bioinformatics/bti640
Abstract
Motivation: Genomic DNA was hybridized to oligonucleotide microarrays to identify single-feature polymorphisms (SFP) for Arabidopsis, which has a genome size of ∼130 Mb. However, that method does not work well for organisms such as barley, with a much larger 5200 Mb genome. In the present study, we demonstrate SFP detection using a small number of replicate datasets and complex RNA as a surrogate for barley DNA. To identify single probes defining SFPs in the data, we developed a method using robustified projection pursuit (RPP). This method first evaluates, for each probe set, the overall differentiation of signal intensities between two genotypes and then measures the contribution of the individual probes within the probe set to the overall differentiation. Results: RNA from whole seedlings with and without dehydration stress provided ‘present’ calls for ∼75% of probe sets. Using triplicated data, among the 5% of ‘present’ probe sets identified as most likely to contain at least one SFP probe, at least 80% are correctly predicted. This was determined by direct sequencing of PCR amplicons derived from barley genomic DNA. Using a 5 percentile cutoff, we defined 2007 SFP probes contained in 1684 probe sets by combining three parental genotype comparisons: Steptoe versus Morex, Morex versus Barke and Oregon Wolfe Barley Dominant versus Recessive. Availability: The algorithm is available upon request from the corresponding author. Contact:xinping.cui@ucr.edu Supplementary Information:http://faculty.ucr.edu/~xpcuiKeywords
This publication has 15 references indexed in Scilit:
- Single-feature polymorphism discovery in the barley transcriptomeGenome Biology, 2005
- Simultaneous genotyping, gene-expression measurement, and detection of allele-specific expression with oligonucleotide arraysGenome Research, 2005
- A New Resource for Cereal Genomics: 22K Barley GeneChip Comes of AgePlant Physiology, 2004
- Solving the riddle of the bright mismatches: Labeling and effective binding in oligonucleotide arraysPhysical Review E, 2003
- Summaries of Affymetrix GeneChip probe level dataNucleic Acids Research, 2003
- Large-Scale Identification of Single-Feature Polymorphisms in Complex GenomesGenome Research, 2003
- Robust estimators for expression analysisBioinformatics, 2002
- Model-based analysis of oligonucleotide arrays: Expression index computation and outlier detectionProceedings of the National Academy of Sciences, 2000
- A view of plant dehydrins using antibodies specific to the carboxy terminal peptidePlant Molecular Biology, 1993
- Unmasking Multivariate Outliers and Leverage PointsJournal of the American Statistical Association, 1990