Application of machine learning to structural molecular biology
- 29 June 1994
- journal article
- Published by The Royal Society in Philosophical Transactions Of The Royal Society B-Biological Sciences
- Vol. 344 (1310), 365-371
- https://doi.org/10.1098/rstb.1994.0075
Abstract
A technique of machine learning, inductive logic programming implemented in the program GOLEM, has been applied to three problems in structural molecular biology. These problems are: the prediction of protein secondary structure; the identification of rules governing the arrangement of β-sheets strands in the tertiary folding of proteins; and the modelling of a quantitative structure activity relationship (QSAR) of a series of drugs. For secondary structure prediction and the QSAR, GOLEM yielded predictions comparable with contemporary approaches including neural networks. Rules for β-strand arrangement are derived and it is planned to contrast their accuracy with those obtained by human inspection. In all three studies GOLEM discovered rules that provided insight into the stereochemistry of the system. We conclude machine leaning used together with human intervention will provide a powerful tool to discover patterns in biological sequences and structures.Keywords
This publication has 12 references indexed in Scilit:
- New approaches to QSAR: Neural networks and machine learningPerspectives in Drug Discovery and Design, 1993
- Drug design by machine learning: the use of inductive logic programming to model the structure-activity relationships of trimethoprim analogues binding to dihydrofolate reductase.Proceedings of the National Academy of Sciences, 1992
- Protein secondary structure prediction using logic-based machine learningProtein Engineering, Design and Selection, 1992
- Protein topology prediction through constraint-based search and the evaluation of topological folding rulesProtein Engineering, Design and Selection, 1991
- Crystallographic investigation of the cooperative interaction between trimethoprim, reduced cofactor and dihydrofolate reductaseFEBS Letters, 1986
- Reasoning about protein topology using the logic programming language PROLOGJournal of Molecular Graphics, 1985
- The Anatomy and Taxonomy of Protein StructureAdvances in protein chemistry, 1981
- β-Sheet topology and the relatedness of proteinsNature, 1977
- Quantitative approach to biochemical structure-activity relationshipsAccounts of Chemical Research, 1969
- Correlation of Biological Activity of Phenoxyacetic Acids with Hammett Substituent Constants and Partition CoefficientsNature, 1962