A robust statistical method for case-control association testing with copy number variation

7 September 2008

journal article
research article
Published by Springer Nature in Nature Genetics

Vol. 40 (10), 1245-1252
https://doi.org/10.1038/ng.206

Abstract

Matt Hurles and colleagues present a general statistical framework for copy number variation (CNV) association tests in a case-control study design. They show that existing strategies for CNV association with binary disease phenotypes are complicated by differential errors and poor clustering quality. Here they report new methods, robust to these factors, which apply likelihood ratio testing to constrained Gaussian mixture models of quantitative CNV signals in cases and controls. Their methods are assay and platform independent, and implemented in freely available CNVtools software. Copy number variation (CNV) is pervasive in the human genome and can play a causal role in genetic diseases. The functional impact of CNV cannot be fully captured through linkage disequilibrium with SNPs. These observations motivate the development of statistical methods for performing direct CNV association studies. We show through simulation that current tests for CNV association are prone to false-positive associations in the presence of differential errors between cases and controls, especially if quantitative CNV measurements are noisy. We present a statistical framework for performing case-control CNV association studies that applies likelihood ratio testing of quantitative CNV measurements in cases and controls. We show that our methods are robust to differential errors and noisy data and can achieve maximal theoretical power. We illustrate the power of these methods for testing for association with binary and quantitative traits, and have made this software available as the R package CNVtools.

Keywords

This publication has 23 references indexed in Scilit:

Psoriasis is associated with increased β-defensin genomic copy number
Nature Genetics, 2007
Copy-number variation and association studies of human disease
Nature Genetics, 2007
A new multipoint method for genome-wide association studies by imputation of genotypes
Nature Genetics, 2007
Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls
Nature, 2007
Gene Copy-Number Variation and Associated Polymorphisms of Complement Component C4 in Human Systemic Lupus Erythematosus (SLE): Low Copy Number Is a Risk Factor for and High Copy Number Is a Protective Factor against SLE Susceptibility in European Americans
American Journal of Human Genetics, 2007
Global variation in copy number in the human genome
Nature, 2006
A Chromosome 8 Gene-Cluster Polymorphism with Low Human Beta-Defensin 2 Gene Copy Number Predisposes to Crohn Disease of the Colon
American Journal of Human Genetics, 2006
Copy number polymorphism in Fcgr3 predisposes to glomerulonephritis in rats and humans
Nature, 2006
Fine-scale structural variation of the human genome
Nature Genetics, 2005
High frequencies of α-thalassaemia are the result of natural selection by malaria
Nature, 1986

Cited by 151 articles