Annotating the human genome with Disease Ontology
Open Access
- 7 July 2009
- journal article
- research article
- Published by Springer Nature in BMC Genomics
- Vol. 10 (S1), 1-8
- https://doi.org/10.1186/1471-2164-10-s1-s6
Abstract
Background: The human genome has been extensively annotated with Gene Ontology for biological functions, but minimally computationally annotated for diseases. Results: We used the Unified Medical Language System (UMLS) MetaMap Transfer tool (MMTx) to discover gene-disease relationships from the GeneRIF database. We utilized a comprehensive subset of UMLS, which is disease-focused and structured as a directed acyclic graph (the Disease Ontology), to filter and interpret results from MMTx. The results were validated against the Homayouni gene collection using recall and precision measurements. We compared our results with the widely used Online Mendelian Inheritance in Man (OMIM) annotations. Conclusion: The validation data set suggests a 91% recall rate and 97% precision rate of disease annotation using GeneRIF, in contrast with a 22% recall and 98% precision using OMIM. Our thesaurus-based approach allows for comparisons to be made between disease containing databases and allows for increased accuracy in disease identification through synonym matching. The much higher recall rate of our approach demonstrates that annotating human genome with Disease Ontology and GeneRIF for diseases dramatically increases the coverage of the disease annotation of human genome.Keywords
This publication has 23 references indexed in Scilit:
- Inherited disorder phenotypes: controlled annotation and statistical analysis for knowledge mining from gene listsBMC Bioinformatics, 2005
- Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profilesProceedings of the National Academy of Sciences, 2005
- GFINDer: genetic disease and phenotype location statistical analysis and mining of dynamically annotated gene listsNucleic Acids Research, 2005
- The Mammalian Phenotype Ontology as a tool for annotating, analyzing and comparing phenotypic informationGenome Biology, 2004
- Using literature-based discovery to identify disease candidate genesInternational Journal of Medical Informatics, 2004
- The Genetic Association DatabaseNature Genetics, 2004
- Mining the Biomedical Literature in the Genomic Era: An OverviewJournal of Computational Biology, 2003
- Analysis of Genomic and Proteomic Data Using Advanced Literature MiningJournal of Proteome Research, 2003
- DAVID: Database for Annotation, Visualization, and Integrated DiscoveryGenome Biology, 2003
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000