CDD: specific functional annotation with the Conserved Domain Database

Top Cited Papers

Open Access

1 January 2009

journal article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 37 (Database), D205-D210
https://doi.org/10.1093/nar/gkn845

Abstract

NCBI's Conserved Domain Database (CDD) is a collection of multiple sequence alignments and derived database search models, which represent protein domains conserved in molecular evolution. The collection can be accessed at http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml, and is also part of NCBI's Entrez query and retrieval system, cross-linked to numerous other resources. CDD provides annotation of domain footprints and conserved functional sites on protein sequences. Precalculated domain annotation can be retrieved for protein sequences tracked in NCBI's Entrez system, and CDD's collection of models can be queried with novel protein sequences via the CD-Search service at http://www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi. Starting with the latest version of CDD, v2.14, information from redundant and homologous domain models is summarized at a superfamily level, and domain annotation on proteins is flagged as either 'specific' (identifying molecular function with high confidence) or as 'non-specific' (identifying superfamily membership only).

Keywords

This publication has 14 references indexed in Scilit:

Protein subfamily assignment using the Conserved Domain Database
BMC Research Notes, 2008
Database resources of the National Center for Biotechnology Information
Nucleic Acids Research, 2007
Data growth and its impact on the SCOP database: new developments
Nucleic Acids Research, 2007
CDD: a conserved domain database for interactive domain family analysis
Nucleic Acids Research, 2006
SMART 5: domains in the context of genomes and networks
Nucleic Acids Research, 2006
Human and mouse oligonucleotide-based array CGH
Nucleic Acids Research, 2005
CDD: a Conserved Domain Database for protein classification
Nucleic Acids Research, 2004
CDART: Protein Homology by Domain Architecture
Genome Research, 2002
CDD: a database of conserved domain alignments with links to domain three-dimensional structure
Nucleic Acids Research, 2002
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997

Cited by 944 articles