Estimate of human gene number provided by genome-wide analysis using Tetraodon nigroviridis DNA sequence
- 1 June 2000
- journal article
- research article
- Published by Springer Nature in Nature Genetics
- Vol. 25 (2), 235-238
- https://doi.org/10.1038/76118
Abstract
The number of genes in the human genome is unknown, with estimates ranging from 50,000 to 90,000 (refs 1, 2), and to more than 140,000 according to unpublished sources. We have developed ‘Exofish’, a procedure based on homology searches, to identify human genes quickly and reliably. This method relies on the sequence of another vertebrate, the pufferfish Tetraodon nigroviridis, to detect conserved sequences with a very low background. Similar to Fugu rubripes , a marine pufferfish proposed by Brenner et al.3 as a model for genomic studies, T. nigroviridis is a more practical alternative4 with a genome also eight times more compact than that of human. Many comparisons have been made between F. rubripes and human DNA that demonstrate the potential of comparative genomics using the pufferfish genome5. Application of Exofish to the December version of the working draft sequence of the human genome and to Unigene showed that the human genome contains 28,000–34,000 genes, and that Unigene contains less than 40% of the protein-coding fraction of the human genome.Keywords
This publication has 12 references indexed in Scilit:
- The DNA sequence of human chromosome 22Nature, 1999
- Generation and Analysis of 25 Mb of Genomic DNA from the Pufferfish Fugu rubripes by Sequence ScanningGenome Research, 1999
- Tandem repeats finder: a program to analyze DNA sequencesNucleic Acids Research, 1999
- A Physical Map of 30,000 Human GenesScience, 1998
- Tetraodon fluviatilis,a New Puffer Fish Model for Genome StudiesGenomics, 1997
- How many genes in the human genome?Nature Genetics, 1994
- Number of CpG islands and genes in human and mouse.Proceedings of the National Academy of Sciences, 1993
- Characterization of the pufferfish (Fugu) genome as a compact model vertebrate genomeNature, 1993
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Identification of common molecular subsequencesJournal of Molecular Biology, 1981