The InterPro database, an integrated documentation resource for protein families, domains and functional sites
Top Cited Papers
- 1 January 2001
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 29 (1), 37-40
- https://doi.org/10.1093/nar/29.1.37
Abstract
Signature databases are vital tools for identifying distant relationships in novel sequences and hence for inferring protein function. InterPro is an integrated documentation resource for protein families, domains and functional sites, which amalgamates the efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. Each InterPro entry includes a functional description, annotation, literature references and links back to the relevant member database(s). Release 2.0 of InterPro (October 2000) contains over 3000 entries, representing families, domains, repeats and sites of post-translational modification encoded by a total of 6804 different regular expressions, profiles, fingerprints and Hidden Markov Models. Each InterPro entry lists all the matches against;SWISS-PROT and TrEMBL (more than 1 000 000 hits from 462 500 proteins in SWISS-PROT and TrEMBL). The database is accessible for text- and sequence-based searches at http://www.ebi.ac.uk/interpro/. Questions can be emailed to interhelp@ebi.ac.uk.Keywords
This publication has 13 references indexed in Scilit:
- Comparative Genomics of the EukaryotesScience, 2000
- The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000Nucleic Acids Research, 2000
- ProDom and ProDom-CG: tools for protein domain analysis and whole genome comparisonsNucleic Acids Research, 2000
- The Pfam Protein Families DatabaseNucleic Acids Research, 2000
- Increased coverage of protein families with the Blocks Database serversNucleic Acids Research, 2000
- PRINTS-S: the database formerly known as PRINTSNucleic Acids Research, 2000
- A novel method for automatic functional annotation of proteins.Bioinformatics, 1999
- SMART, a simple modular architecture research tool: Identification of signaling domainsProceedings of the National Academy of Sciences, 1998
- Efficient discovery of conserved patterns using a pattern graph.Bioinformatics, 1997
- [8] SRS: Information retrieval system for molecular biology data banksMethods in Enzymology, 1996