The Human Genome Project: a paradigm for information management in the life sciences

Abstract
The major product of the Human Genome Project will be a series of linked data sets containing the genetic and physical location of all genes on each chromosome, plus the complete nucleotide sequence of the genome for humans and several model organisms. Here we summarize the current status of attempts to collect, analyze, and distribute this information in an electronically accessible form. Although formidable problems remain to be solved in the acquisition and adequate representation of the genetic, physical, and biological data, this project is a model for the rapid dissemination of genome and related information in biology and medicine.