IMG ER: a system for microbial genome annotation expert review and curation
Top Cited Papers
Open Access
- 27 June 2009
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 25 (17), 2271-2278
- https://doi.org/10.1093/bioinformatics/btp393
Abstract
Motivation: A rapidly increasing number of microbial genomes are sequenced by organizations worldwide and are eventually included into various public genome data resources. The quality of the annotations depends largely on the original dataset providers, with erroneous or incomplete annotations often carried over into the public resources and difficult to correct. Results: We have developed an Expert Review (ER) version of the Integrated Microbial Genomes (IMG) system, with the goal of supporting systematic and efficient revision of microbial genome annotations. IMG ER provides tools for the review and curation of annotations of both new and publicly available microbial genomes within IMG's rich integrated genome framework. New genome datasets are included into IMG ER prior to their public release either with their native annotations or with annotations generated by IMG ER's annotation pipeline. IMG ER tools allow addressing annotation problems detected with IMG's comparative analysis tools, such as genes missed by gene prediction pipelines or genes without an associated function. Over the past year, IMG ER was used for improving the annotations of about 150 microbial genomes. Contact:vmmarkowitz@lbl.gov Supplementary information: Supplementary data are available at Bioinformatics online.This publication has 26 references indexed in Scilit:
- Pseudomonas Genome Database: facilitating user-friendly, comprehensive comparisons of microbial genomesNucleic Acids Research, 2008
- The minimum information about a genome sequence (MIGS) specificationNature Biotechnology, 2008
- KEGG for linking genomes to life and the environmentNucleic Acids Research, 2007
- The Gene Ontology project in 2008Nucleic Acids Research, 2007
- The Genomes On Line Database (GOLD) in 2007: status of genomic and metagenomic projects and their associated metadataNucleic Acids Research, 2007
- The MetaCyc Database of metabolic pathways and enzymes and the BioCyc collection of Pathway/Genome DatabasesNucleic Acids Research, 2007
- The integrated microbial genomes (IMG) system in 2007: data content and analysis tool extensionsNucleic Acids Research, 2007
- IMG/M: a data management and analysis system for metagenomesNucleic Acids Research, 2007
- NCBI reference sequences (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteinsNucleic Acids Research, 2007
- TIGRFAMs and Genome Properties: tools for the assignment of molecular function and biological process in prokaryotic genomesNucleic Acids Research, 2006