Prediction of the Coding Sequences of Unidentified Human Genes. III. The Coding Sequences of 40 New Genes (KIAA0081-KIAA0120) Deduced by Analysis of cDNA Clones from Human Cell Line KG-1

Abstract
We isolated full-length cDNA clones from size-fractionated cDNA libraries of human immature myeloid cell line KG-1, and the coding sequences of 40 genes were newly predicted. A computer search of the GenBank/EMBL databases indicated that the sequences of 14 genes were unrelated to any reported genes, while the remaining 26 genes carried some sequences with similarities to known genes. Significant transmembrane domains were identified in 17 genes, and protein motifs that matched those in the PROSITE motif database were identified in 11 genes. Northern hybridization analysis with 18 different cells and tissues demonstrated that 10 genes were apparently expressed in a cell-specific or tissue-specific manner. Among the genes predicted, half were isolated from the medium-sized cDNA library and the other half from the small-sized cDNA library, and their average sizes were 4 kb and 1.4 kb, respectively. As judged by Northern hybridization profiles, small-sized cDNAs appeared to be expressed more ubiquitously and abundantly in various tissues, compared with that of medium-sized cDNAs.