Accurate Discovery of Expression Quantitative Trait Loci Under Confounding From Spurious and Genuine Regulatory Hotspots
- 1 December 2008
- journal article
- research article
- Published by Oxford University Press (OUP) in Genetics
- Vol. 180 (4), 1909-1925
- https://doi.org/10.1534/genetics.108.094201
Abstract
In genomewide mapping of expression quantitative trait loci (eQTL), it is widely believed that thousands of genes are trans-regulated by a small number of genomic regions called “regulatory hotspots,” resulting in “trans-regulatory bands” in an eQTL map. As several recent studies have demonstrated, technical confounding factors such as batch effects can complicate eQTL analysis by causing many spurious associations including spurious regulatory hotspots. Yet little is understood about how these technical confounding factors affect eQTL analyses and how to correct for these factors. Our analysis of data sets with biological replicates suggests that it is this intersample correlation structure inherent in expression data that leads to spurious associations between genetic loci and a large number of transcripts inducing spurious regulatory hotspots. We propose a statistical method that corrects for the spurious associations caused by complex intersample correlation of expression measurements in eQTL mapping. Applying our intersample correlation emended (ICE) eQTL mapping method to mouse, yeast, and human identifies many more cis associations while eliminating most of the spurious trans associations. The concordances of cis and trans associations have consistently increased between different replicates, tissues, and populations, demonstrating the higher accuracy of our method to identify real genetic effects.Keywords
This publication has 52 references indexed in Scilit:
- Integrating large-scale functional genomic data to dissect the complexity of yeast regulatory networksNature Genetics, 2008
- SNPs matter: impact on detection of differential expressionNature Methods, 2007
- Regulatory network construction in Arabidopsis by using genome-wide gene expression quantitative trait lociProceedings of the National Academy of Sciences, 2007
- Common genetic variants account for differences in gene expression among ethnic groupsNature Genetics, 2007
- Identifying regulatory mechanisms using individual variation reveals key role for chromatin modificationProceedings of the National Academy of Sciences, 2006
- Principal components analysis corrects for stratification in genome-wide association studiesNature Genetics, 2006
- A unified mixed-model method for association mapping that accounts for multiple levels of relatednessNature Genetics, 2005
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- The International HapMap ProjectNature, 2003
- Trans-acting regulatory variation in Saccharomyces cerevisiae and the role of transcription factorsNature Genetics, 2003