Using OrthoMCL to Assign Proteins to OrthoMCL‐DB Groups or to Cluster Proteomes Into New Ortholog Groups
Open Access
- 1 September 2011
- journal article
- unit
- Published by Wiley in Current Protocols in Bioinformatics
- Vol. 35 (1), 6.12.1-6.12.19
- https://doi.org/10.1002/0471250953.bi0612s35
Abstract
OrthoMCL is an algorithm for grouping proteins into ortholog groups based on their sequence similarity. OrthoMCL‐DB is a public database that allows users to browse and view ortholog groups that were pre‐computed using the OrthoMCL algorithm. Version 4 of this database contained 116,536 ortholog groups clustered from 1,270,853 proteins obtained from 88 eukaryotic genomes, 16 archaean genomes, and 34 bacterial genomes. Future versions of OrthoMCL‐DB will include more proteomes as more genomes are sequenced. Here, we describe how you can group your proteins of interest into ortholog clusters using two different means provided by the OrthoMCL system. The OrthoMCL‐DB Web site has a tool for uploading and grouping a set of protein sequences, typically representing a proteome. This method maps the uploaded proteins to existing groups in OrthoMCL‐DB. Alternatively, if you have proteins from a set of genomes that need to be grouped, you can download, install, and run the stand‐alone OrthoMCL software. Curr. Protoc. Bioinform. 35:6.12.1‐6.12.19. © 2011 by John Wiley & Sons, Inc.Keywords
This publication has 9 references indexed in Scilit:
- Assessing Performance of Orthology Detection Strategies Applied to Eukaryotic GenomesPLOS ONE, 2007
- OrthoMCL-DB: querying a comprehensive multi-species collection of ortholog groupsNucleic Acids Research, 2006
- Genome sequence of the enterobacterial phytopathogen Erwinia carotovora subsp atroseptica and characterization of virulence factorsProceedings of the National Academy of Sciences of the United States of America, 2004
- The Pfam protein families databaseNucleic Acids Research, 2004
- pkn22 (alr2502) encoding a putative Ser/Thr kinase in the cyanobacterium Anabaena sp. PCC 7120 is induced by both iron starvation and oxidative stress and regulates the expression of isiAFEBS Letters, 2003
- OrthoMCL: Identification of Ortholog Groups for Eukaryotic GenomesGenome Research, 2003
- An efficient algorithm for large-scale detection of protein familiesNucleic Acids Research, 2002
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- Basic local alignment search toolJournal of Molecular Biology, 1990