The relationship between non‐protein‐coding DNA and eukaryotic complexity
Top Cited Papers
- 12 February 2007
- Vol. 29 (3), 288-299
- https://doi.org/10.1002/bies.20544
Abstract
There are two intriguing paradoxes in molecular biology—the inconsistent relationship between organismal complexity and (1) cellular DNA content and (2) the number of protein‐coding genes—referred to as the C‐value and G‐value paradoxes, respectively. The C‐value paradox may be largely explained by varying ploidy. The G‐value paradox is more problematic, as the extent of protein coding sequence remains relatively static over a wide range of developmental complexity. We show by analysis of sequenced genomes that the relative amount of non‐protein‐coding sequence increases consistently with complexity. We also show that the distribution of introns in complex organisms is non‐random. Genes composed of large amounts of intronic sequence are significantly overrepresented amongst genes that are highly expressed in the nervous system, and amongst genes downregulated in embryonic stem cells and cancers. We suggest that the informational paradox in complex organisms may be explained by the expansion of cis‐acting regulatory elements and genes specifying trans‐acting non‐protein‐coding RNAs. BioEssays 29: 288–299, 2007.Keywords
This publication has 119 references indexed in Scilit:
- Biological function of unannotated transcription during the early development of Drosophila melanogasterNature Genetics, 2006
- A distal enhancer and an ultraconserved exon are derived from a novel retroposonNature, 2006
- Regulating Gene Expression through RNA Nuclear RetentionCell, 2005
- The map-based sequence of the rice genomeNature, 2005
- The genome of the social amoeba Dictyostelium discoideumNature, 2005
- Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolutionNature, 2004
- Genome duplication in the teleost fish Tetraodon nigroviridis reveals the early vertebrate proto-karyotypeNature, 2004
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- Selfish genes, the phenotype paradigm and genome evolutionNature, 1980