Sesam: A relational database for structure and sequence of macromolecules

1 September 1991

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 11 (1), 59-76
https://doi.org/10.1002/prot.340110108

Abstract

A system is described that provides ways of integrating data on protein structure, sequence, and survey results, with molecular graphics and molecular mechanics software. Its major component is the relational database SESAM, presently implemented under the commercial package SYBASE. By desin, the database allows full integration—within the same data organization—of raw data on protein structure, sequence, ligands, and heterogroups, obtained from the Brookhaven Protein Databank, with pure sequence information available from other databanks such as SWISS-PROT. It contains in addition higher level descriptions of structural and topological properties, as well as survey results, obtained by executing specialized computer programs. Aside from the very useful attribute of closely combining structural and nonstructural information, other important features distinguish it from analogous systems developed elsewhere. It includes a molecular dictionary with complete description of geometric properties and energy parameters used in modeling and conformational energy calculations. Using this dictionary, structural data are validated by checking for localized inconsistencies in atomic coordinates, atomic symbols, chirality definitions, and flagging errors and incomplete entries. Because of both the dictionary and the validation procedures, SESAM can be readily interfaced with conventional molecular graphics and mechanics software packages, or with other specialized application programs. With the aid of appropriate interfaces, data access is sufficiently fast for SESAM to be interrogated interactively. Prototypes of user interfaces, as well as an interface with the molecular graphics package BRUGEL, are described and the power of the system is illustrated in applications such as homology-based protein modeling, computer-aided protein design, protein structure predictions, analysis of local structure motifs, and of relationships between protein sequence and structure.

Keywords

This publication has 49 references indexed in Scilit:

Between objectivity and subjectivity
Nature, 1990
Construction of side-chains in homology modelling
Journal of Molecular Biology, 1989
Reshaping the antibody combining site by CDR replacement—tailoring or tinkering to fit?
Protein Engineering, Design and Selection, 1988
Knowledge-based prediction of protein structures and the design of novel molecules
Nature, 1987
The Dynamics of Proteins
Scientific American, 1986
Construction of a model for the three-dimensional structure of human renal renin.
Hypertension, 1985
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Biopolymers, 1983
The functional data model and the data languages DAPLEX
ACM Transactions on Database Systems, 1981
Analysis of the accuracy and implications of simple methods for predicting the secondary structure of globular proteins
Journal of Molecular Biology, 1978
The protein data bank: A computer-based archival file for macromolecular structures
Journal of Molecular Biology, 1977

Cited by 28 articles