A single-array preprocessing method for estimating full-resolution raw copy numbers from all Affymetrix genotyping arrays including GenomeWideSNP 5 & 6
Open Access
- 17 June 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (17), 2149-2156
- https://doi.org/10.1093/bioinformatics/btp371
Abstract
Motivation: High-resolution copy-number (CN) analysis has in recent years gained much attention, not only for the purpose of identifying CN aberrations associated with a certain phenotype, but also for identifying CN polymorphisms. In order for such studies to be successful and cost effective, the statistical methods have to be optimized. We propose a single-array preprocessing method for estimating full-resolution total CNs. It is applicable to all Affymetrix genotyping arrays, including the recent ones that also contain non-polymorphic probes. A reference signal is only needed at the last step when calculating relative CNs. Results: As with our method for earlier generations of arrays, this one controls for allelic crosstalk, probe affinities and PCR fragment-length effects. Additionally, it also corrects for probe sequence effects and co-hybridization of fragments digested by multiple enzymes that takes place on the latest chips. We compare our method with Affymetrix's CN5 method and the dChip method by assessing how well they differentiate between various CN states at the full resolution and various amounts of smoothing. Although CRMA v2 is a single-array method, we observe that it performs as well as or better than alternative methods that use data from all arrays for their preprocessing. This shows that it is possible to do online analysis in large-scale projects where additional arrays are introduced over time. Availability: A bounded-memory implementation that can process any number of arrays is available in the open source R package aroma.affymetrix. Contact:hb@stat.berkeley.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 18 references indexed in Scilit:
- A single-sample method for normalizing and combining full-resolution copy numbers from multiple platforms, labs and analysis methodsBioinformatics, 2009
- High-resolution mapping of copy-number alterations with massively parallel sequencingNature Methods, 2008
- Estimation and assessment of raw copy numbers at the single locus levelBioinformatics, 2008
- Free energy of DNA duplex formation on short oligonucleotide microarraysNucleic Acids Research, 2006
- Global variation in copy number in the human genomeNature, 2006
- A haplotype map of the human genomeNature, 2005
- A Model-Based Background Adjustment for Oligonucleotide Expression ArraysJournal of the American Statistical Association, 2004
- Sensitivity of Microarray Oligonucleotide Probes: Variability and Effect of Base CompositionThe Journal of Physical Chemistry B, 2004
- The International HapMap ProjectNature, 2003
- Solving the riddle of the bright mismatches: Labeling and effective binding in oligonucleotide arraysPhysical Review E, 2003