The outcomes of pathway database computations depend on pathway ontology
Open Access
- 1 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (13), 3687-3697
- https://doi.org/10.1093/nar/gkl438
Abstract
Different biological notions of pathways are used in different pathway databases. Those pathway ontologies significantly impact pathway computations. Computational users of pathway databases will obtain different results depending on the pathway ontology used by the databases they employ, and different pathway ontologies are preferable for different end uses. We explore differences in pathway ontologies by comparing the BioCyc and KEGG ontologies. The BioCyc ontology defines a pathway as a conserved, atomic module of the metabolic network of a single organism, i.e. often regulated as a unit, whose boundaries are defined at high-connectivity stable metabolites. KEGG pathways are on average 4.2 times larger than BioCyc pathways, and combine multiple biological processes from different organisms to produce a substrate-centered reaction mosaic. We compared KEGG and BioCyc pathways using genome context methods, which determine the functional relatedness of pairs of genes. For each method we employed, a pair of genes randomly selected from a BioCyc pathway is more likely to be related by that method than is a pair of genes randomly selected from a KEGG pathway, supporting the conclusion that the BioCyc pathway conceptualization is closer to a single conserved biological process than is that of KEGG.Keywords
This publication has 30 references indexed in Scilit:
- Representations of molecular pathways: an evaluation of SBML, PSI MI and BioPAXBioinformatics, 2005
- PAX of mind for pathway researchersDrug Discovery Today, 2005
- Comparison of network-based pathway analysis methodsTrends in Biotechnology, 2004
- Identifying Protein Function—A Call for Community ActionPLoS Biology, 2004
- Identification of functional links between genes using phylogenetic profilesBioinformatics, 2003
- The Pathway Tools softwareBioinformatics, 2002
- GenMAPP, a new tool for viewing and analyzing microarray data on biological pathwaysNature Genetics, 2002
- Prediction of protein interactions: metabolic enzymes are frequently involved in gene fusionNature Genetics, 2000
- Detecting Protein Function and Protein-Protein Interactions from Genome SequencesScience, 1999
- Integrated pathway–genome databases and their role in drug discoveryTrends in Biotechnology, 1999