Grammatical model of the regulation of gene expression.
- 15 October 1992
- journal article
- Published by Proceedings of the National Academy of Sciences in Proceedings of the National Academy of Sciences
- Vol. 89 (20), 9405-9409
- https://doi.org/10.1073/pnas.89.20.9405
Abstract
Based on a formal proof that justifies the search for generative grammars in the study of gene regulation, a linguistic formalization of an exhaustive data base of Escherichia coli sigma 70 promoters and their regulatory binding sites has been initiated. The grammar presented here generates all the arrays of the collection plus those that are predicted as consistent with the principles of regulation of sigma 70 promoters. "Systems of regulation," sets of regulatory sites that collaborate in a mechanism of regulation, are represented by means of syntactic categories. A small set of phrase structure rules restricted by an X-bar principle and by a hierarchical, c-command relation generates a representation of arrays of sites of regulation where the selection of the protein(s) identifying the system(s) of regulation occurs. Based on the features of the proteins, optional duplicated proximal and remote sites are generated by means of transformational rules. Consistency with the data, the predictions that the grammar generates, and important similarities and differences with some aspects of the generative theory of natural language are discussed.Keywords
This publication has 9 references indexed in Scilit:
- Heterologous cooperativity in Escherichia coli. The CytR repressor both contacts DNA and the cAMP receptor protein when binding to the deoP2 promoter.Journal of Biological Chemistry, 1991
- Control site location and transcriptional regulation in Escherichia coli.1991
- The search for a grammatical theory of gene regulation is formally justified by showing the inadequacy of context-free grammarsBioinformatics, 1991
- A syntactic representation of units of genetic information—A syntax of units of genetic informationJournal of Theoretical Biology, 1991
- Reconstructionist molecular biology.1991
- Stringent spacing requirements for transcription activation by CRPCell, 1990
- A complex nucleoprotein structure involved in activation of transcription of two divergent Escherichia coli promotersJournal of Molecular Biology, 1989
- Formal language theory and DNA: An analysis of the generative capacity of specific recombinant behaviorsBulletin of Mathematical Biology, 1987
- Mechanism of action of the lexA gene product.Proceedings of the National Academy of Sciences, 1981