Sesam: A relational database for structure and sequence of macromolecules
- 1 September 1991
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 11 (1), 59-76
- https://doi.org/10.1002/prot.340110108
Abstract
A system is described that provides ways of integrating data on protein structure, sequence, and survey results, with molecular graphics and molecular mechanics software. Its major component is the relational database SESAM, presently implemented under the commercial package SYBASE. By desin, the database allows full integration—within the same data organization—of raw data on protein structure, sequence, ligands, and heterogroups, obtained from the Brookhaven Protein Databank, with pure sequence information available from other databanks such as SWISS-PROT. It contains in addition higher level descriptions of structural and topological properties, as well as survey results, obtained by executing specialized computer programs. Aside from the very useful attribute of closely combining structural and nonstructural information, other important features distinguish it from analogous systems developed elsewhere. It includes a molecular dictionary with complete description of geometric properties and energy parameters used in modeling and conformational energy calculations. Using this dictionary, structural data are validated by checking for localized inconsistencies in atomic coordinates, atomic symbols, chirality definitions, and flagging errors and incomplete entries. Because of both the dictionary and the validation procedures, SESAM can be readily interfaced with conventional molecular graphics and mechanics software packages, or with other specialized application programs. With the aid of appropriate interfaces, data access is sufficiently fast for SESAM to be interrogated interactively. Prototypes of user interfaces, as well as an interface with the molecular graphics package BRUGEL, are described and the power of the system is illustrated in applications such as homology-based protein modeling, computer-aided protein design, protein structure predictions, analysis of local structure motifs, and of relationships between protein sequence and structure.Keywords
This publication has 49 references indexed in Scilit:
- Between objectivity and subjectivityNature, 1990
- Construction of side-chains in homology modellingJournal of Molecular Biology, 1989
- Reshaping the antibody combining site by CDR replacement—tailoring or tinkering to fit?Protein Engineering, Design and Selection, 1988
- Knowledge-based prediction of protein structures and the design of novel moleculesNature, 1987
- The Dynamics of ProteinsScientific American, 1986
- Construction of a model for the three-dimensional structure of human renal renin.Hypertension, 1985
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- The functional data model and the data languages DAPLEXACM Transactions on Database Systems, 1981
- Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteinsJournal of Molecular Biology, 1978
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977