PICNIC: an algorithm to predict absolute allelic copy number variation with microarray cancer data
Open Access
- 15 October 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Biostatistics
- Vol. 11 (1), 164-175
- https://doi.org/10.1093/biostatistics/kxp045
Abstract
High-throughput oligonucleotide microarrays are commonly employed to investigate genetic disease, including cancer. The algorithms employed to extract genotypes and copy number variation function optimally for diploid genomes usually associated with inherited disease. However, cancer genomes are aneuploid in nature leading to systematic errors when using these techniques. We introduce a preprocessing transformation and hidden Markov model algorithm bespoke to cancer. This produces genotype classification, specification of regions of loss of heterozygosity, and absolute allelic copy number segmentation. Accurate prediction is demonstrated with a combination of independent experimental techniques. These methods are exemplified with affymetrix genome-wide SNP6.0 data from 755 cancer cell lines, enabling inference upon a number of features of biological interest. These data and the coded algorithm are freely available for download.Keywords
This publication has 35 references indexed in Scilit:
- Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVsNature Genetics, 2008
- Estimating Genome-Wide Copy Number Using Allele-Specific Mixture ModelsJournal of Computational Biology, 2008
- Array painting reveals a high frequency of balanced translocations in breast cancer cell lines that break in cancer-relevant genesOncogene, 2007
- Characterizing the cancer genome in lung adenocarcinomaNature, 2007
- PennCNV: An integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping dataGenome Research, 2007
- Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controlsNature, 2007
- QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping dataNucleic Acids Research, 2007
- Genotyping and annotation of Affymetrix SNP arraysNucleic Acids Research, 2006
- High-resolution genomic profiling of chromosomal aberrations using Infinium whole-genome genotypingGenome Research, 2006
- A tutorial on hidden Markov models and selected applications in speech recognitionProceedings of the IEEE, 1989