New developments in the InterPro database
Top Cited Papers
Open Access
- 3 January 2007
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 35 (Database), D224-D228
- https://doi.org/10.1093/nar/gkl841
Abstract
InterPro is an integrated resource for protein families, domains and functional sites, which integrates the following protein signature databases: PROSITE, PRINTS, ProDom, Pfam, SMART, TIGRFAMs, PIRSF, SUPERFAMILY, Gene3D and PANTHER. The latter two new member databases have been integrated since the last publication in this journal. There have been several new developments in InterPro, including an additional reading field, new database links, extensions to the web interface and additional match XML files. InterPro has always provided matches to UniProtKB proteins on the website and in the match XML file on the FTP site. Additional matches to proteins in UniParc (UniProt archive) are now available for download in the new match XML files only. The latest InterPro release (13.0) contains more than 13 000 entries, covering over 78% of all proteins in UniProtKB. The database is available for text- and sequence-based searches via a webserver (http://www.ebi.ac.uk/interpro), and for download by anonymous FTP (ftp://ftp.ebi.ac.uk/pub/databases/interpro). The InterProScan search tool is now also available via a web service at http://www.ebi.ac.uk/Tools/webservices/WSInterProScan.html.Keywords
This publication has 22 references indexed in Scilit:
- Gene3D: modelling protein structure, function and evolutionNucleic Acids Research, 2006
- SMART 5: domains in the context of genomes and networksNucleic Acids Research, 2006
- The PROSITE databaseNucleic Acids Research, 2006
- Pfam: clans, web tools and servicesNucleic Acids Research, 2006
- InterPro, progress and status in 2005Nucleic Acids Research, 2004
- The ProDom database of protein domain families: more emphasis on 3DNucleic Acids Research, 2004
- PIRSF: family classification system at the Protein Information ResourceNucleic Acids Research, 2004
- The TIGRFAMs database of protein familiesNucleic Acids Research, 2003
- PRINTS and its automatic supplement, prePRINTSNucleic Acids Research, 2003
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001