Global discovery of primate-specific genes in the human genome
- 21 July 2009
- journal article
- research article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 106 (29), 12019-12024
- https://doi.org/10.1073/pnas.0904569106
Abstract
The genomic basis of primate phenotypic uniqueness remains obscure, despite increasing genome and transcriptome sequence data availability. Although factors such as segmental duplications and positive selection have received much attention as potential drivers of primate phenotypes, single-copy primate-specific genes are poorly characterized. To discover such genes genomewide, we screened a catalog of 38,037 human transcriptional units (TUs), compiled from EST and cDNA sequences in conjunction with the FANTOM3 transcriptome project. We identified 131 TUs from transcribed sequences residing within primate-specific insertions in 9-species sequence alignments and outside of segmental duplications. Exons of 120 (92%) of the TUs contained interspersed repeats, indicating that repeat insertions may have contributed to primate-specific gene genesis. Fifty-nine (46%) primate-specific TUs may encode proteins. Although primate-specific TU transcript lengths were comparable to known human gene mRNA lengths overall, 92 (70%) primate-specific TUs were single-exon. Thirty-two (24%) primate-specific TUs were localized to subtelomeric and pericentromeric regions. Forty (31%) of the TUs were nested in introns of known genes, indicating that primate-specific TUs may arise within older, protein-coding regions. Primate-specific TUs were preferentially expressed in reproductive organs and tissues (P < 0.011), consistent with the expectation that emergence of new, lineage-specific genes may accompany speciation or reproduction. Of the 33 primate-specific TUs with human Affymetrix microarray probe support, 21 were differentially expressed in human teratozoospermia. In addition to elucidating the likely functional relevance of primate-specific TUs to reproduction, we present a set of primate-specific genes for future functional studies, and we implicate nonduplicated pericentromeric and subtelomeric regions in gene genesis.Keywords
This publication has 48 references indexed in Scilit:
- General gene movement off the X chromosome in the Drosophila genusGenome Research, 2009
- Chromatin signature reveals over a thousand highly conserved large non-coding RNAs in mammalsNature, 2009
- The evolution of courtship behaviors through the origination of a new gene in DrosophilaProceedings of the National Academy of Sciences, 2008
- Species-specific endogenous retroviruses shape the transcriptional network of the human tumor suppressor protein p53Proceedings of the National Academy of Sciences, 2007
- Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot projectNature, 2007
- An RNA gene expressed during cortical development evolved rapidly in humansNature, 2006
- Finishing the euchromatic sequence of the human genomeNature, 2004
- Alu-Containing Exons are Alternatively SplicedGenome Research, 2002
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk DNA".Proceedings of the National Academy of Sciences, 1992