Probabilistic inference of molecular networks from noisy data sources

Open Access

10 February 2004

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 20 (8), 1205-1213
https://doi.org/10.1093/bioinformatics/bth061

Abstract

Summary: Information on molecular networks, such as networks of interacting proteins, comes from diverse sources that contain remarkable differences in distribution and quantity of errors. Here, we introduce a probabilistic model useful for predicting protein interactions from heterogeneous data sources. The model describes stochastic generation of protein–protein interaction networks with real-world properties, as well as generation of two heterogeneous sources of protein-interaction information: research results automatically extracted from the literature and yeast two-hybrid experiments. Based on the domain composition of proteins, we use the model to predict protein interactions for pairs of proteins for which no experimental data are available. We further explore the prediction limits, given experimental data that cover only part of the underlying protein networks. This approach can be extended naturally to include other types of biological data sources.

Keywords

Cited by 26 articles