A text-mining analysis of the human phenome
Top Cited Papers
- 22 February 2006
- journal article
- research article
- Published by Springer Nature in European Journal of Human Genetics
- Vol. 14 (5), 535-542
- https://doi.org/10.1038/sj.ejhg.5201585
Abstract
A number of large-scale efforts are underway to define the relationships between genes and proteins in various species. But, few attempts have been made to systematically classify all such relationships at the phenotype level. Also, it is unknown whether such a phenotype map would carry biologically meaningful information. We have used text mining to classify over 5000 human phenotypes contained in the Online Mendelian Inheritance in Man database. We find that similarity between phenotypes reflects biological modules of interacting functionally related genes. These similarities are positively correlated with a number of measures of gene function, including relatedness at the level of protein sequence, protein motifs, functional annotation, and direct protein–protein interaction. Phenotype grouping reflects the modular nature of human disease genetics. Thus, phenotype mapping may be used to predict candidate genes for diseases as well as functional relations between genes and proteins. Such predictions will further improve if a unified system of phenotype descriptors is developed. The phenotype similarity data are accessible through a web interface at http://www.cmbi.ru.nl/MimMiner/.Keywords
This publication has 35 references indexed in Scilit:
- Toward Improving Caenorhabditis elegans Phenome Mapping With an ORFeome-Based RNAi LibraryGenome Research, 2004
- The European dimension for the mouse genome mutagenesis programNature Genetics, 2004
- From syndrome families to functional genomicsNature Reviews Genetics, 2004
- Genome-wide identification of genes likely to be involved in human genetic diseaseNucleic Acids Research, 2004
- Genome-Wide RNAi Analysis of Growth and Viability in Drosophila CellsScience, 2004
- How clinicians add to knowledge of developmentThe Lancet, 2003
- The Human Phenome ProjectNature Genetics, 2003
- Functional profiling of the Saccharomyces cerevisiae genomeNature, 2002
- Association of genes to genetically inherited diseases using data miningNature Genetics, 2002
- Human disease genesNature, 2001