The SUPERFAMILY database in 2007: families and functions
Open Access
- 10 November 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Database), D308-D313
- https://doi.org/10.1093/nar/gkl910
Abstract
The SUPERFAMILY database provides protein domain assignments, at the SCOP ‘superfamily’ level, for the predicted protein sequences in over 400 completed genomes. A superfamily groups together domains of different families which have a common evolutionary ancestor based on structural, functional and sequence data. SUPERFAMILY domain assignments are generated using an expert curated set of profile hidden Markov models. All models and structural assignments are available for browsing and download from Author Webpage. The web interface includes services such as domain architectures and alignment details for all protein assignments, searchable domain combinations, domain occurrence network visualization, detection of over- or under-represented superfamilies for a given genome by comparison with other genomes, assignment of manually submitted sequences and keyword searches. In this update we describe the SUPERFAMILY database and outline two major developments: (i) incorporation of family level assignments and (ii) a superfamily-level functional annotation. The SUPERFAMILY database can be used for general protein evolution and superfamily-specific studies, genomic annotation, and structural genomics target suggestion and assessment.Keywords
This publication has 22 references indexed in Scilit:
- Genomic scale sub-family assignment of protein domainsNucleic Acids Research, 2006
- Protein Family Expansions and Biological ComplexityPLoS Computational Biology, 2006
- The Universal Protein Resource (UniProt): an expanding universe of protein informationNucleic Acids Research, 2006
- Ensembl 2006Nucleic Acids Research, 2006
- DBD: a transcription factor prediction databaseNucleic Acids Research, 2006
- Gene3D: modelling protein structure, function and evolutionNucleic Acids Research, 2006
- Improved profile HMM performance by assessment of critical algorithmic features in SAM and HMMERBMC Bioinformatics, 2005
- A comparison of profile hidden Markov model procedures for remote homology detectionNucleic Acids Research, 2002
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000