Understanding mechanisms underlying human gene expression variation with RNA sequencing
Top Cited Papers
- 10 March 2010
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 464 (7289), 768-772
- https://doi.org/10.1038/nature08872
Abstract
There is currently much interest in the understanding of genetic mechanisms that underlie variation at the gene expression level. Two groups reporting in this issue of Nature use RNA sequencing to study global gene expression in two contrasting populations. Pickrell et al. sequenced RNA from 69 lymphoblastoid cell lines derived from unrelated Nigerian individuals who have been extensively genotyped as part of the HapMap Project. By pooling data from all the individuals it was possible to identify many genetic determinants of variation in gene expression. Montgomery et al. characterize the mRNA fraction of RNA isolated from lymphoblastoid cell lines derived from 63 HapMap individuals of Caucasian origin. They obtain a fine-scale view of the transcriptome and identify genetic variants that affect alternative splicing. There is much interest in understanding the genetic mechanisms that underlie individual variations in gene expression. Here, RNA sequencing has been used to study gene expression in lymphoblastoid cell lines derived from Nigerian individuals for whom extensive genotype information is known. Numerous genetic determinants of variation in gene expression were identified, including variation in transcription, splicing and allele-specific expression. Understanding the genetic mechanisms underlying natural variation in gene expression is a central goal of both medical and evolutionary genetics, and studies of expression quantitative trait loci (eQTLs) have become an important tool for achieving this goal1. Although all eQTL studies so far have assayed messenger RNA levels using expression microarrays, recent advances in RNA sequencing enable the analysis of transcript variation at unprecedented resolution. We sequenced RNA from 69 lymphoblastoid cell lines derived from unrelated Nigerian individuals that have been extensively genotyped by the International HapMap Project2. By pooling data from all individuals, we generated a map of the transcriptional landscape of these cells, identifying extensive use of unannotated untranslated regions and more than 100 new putative protein-coding exons. Using the genotypes from the HapMap project, we identified more than a thousand genes at which genetic variation influences overall expression levels or splicing. We demonstrate that eQTLs near genes generally act by a mechanism involving allele-specific expression, and that variation that influences the inclusion of an exon is enriched within and near the consensus splice sites. Our results illustrate the power of high-throughput sequencing for the joint analysis of variation in transcription, splicing and allele-specific expression across individuals.Keywords
This publication has 31 references indexed in Scilit:
- Transcriptome genetics using second generation sequencing in a Caucasian populationNature, 2010
- Widespread Shortening of 3′UTRs by Alternative Cleavage and Polyadenylation Activates Oncogenes in Cancer CellsCell, 2009
- Alternative isoform regulation in human tissue transcriptomesNature, 2008
- Mapping and quantifying mammalian transcriptomes by RNA-SeqNature Methods, 2008
- A second generation human haplotype map of over 3.1 million SNPsNature, 2007
- Population genomics of human gene expressionNature Genetics, 2007
- Genetic variants regulating ORMDL3 expression contribute to the risk of childhood asthmaNature, 2007
- Genetics of global gene expressionNature Reviews Genetics, 2006
- Systematic discovery of regulatory motifs in human promoters and 3′ UTRs by comparison of several mammalsNature, 2005
- Natural variation in human gene expression assessed in lymphoblastoid cellsNature Genetics, 2003