Generation and analysis of 280,000 human expressed sequence tags.
Open Access
- 1 September 1996
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 6 (9), 807-828
- https://doi.org/10.1101/gr.6.9.807
Abstract
We report the generation of 319,311 single-pass sequencing reactions (known as expressed sequence tags, or ESTs) obtained from the 5' and 3' ends of 194,031 human cDNA clones. Our goal has been to obtain tag sequences from many different genes and to deposit these in the publicly accessible Data Base for Expressed Sequence Tags. Highly efficient automatic screening of the data allows deposition of the annotated sequences without delay. Sequences have been generated from 26 oligo(dT) primed directionally cloned libraries, of which 18 were normalized. The libraries were constructed using mRNA isolated from 17 different tissues representing three developmental states. Comparisons of a subset of our data with nonredundant human mRNA and protein data bases show that the ESTs represent many known sequences and contain many that are novel. Analysis of protein families using Hidden Markov Models confirms this observation and supports the contention that although normalization reduces significantly the relative abundance of redundant cDNA clones, it does not result in the complete removal of members of gene families.Keywords
This publication has 34 references indexed in Scilit:
- Genome sequencing: The complete code for a eukaryotic cellCurrent Biology, 1996
- Regional assignment of EST sequences on human chromosome 13Cytogenetic and Genome Research, 1996
- Maximum Discrimination Hidden Markov Models of Sequence ConsensusJournal of Computational Biology, 1995
- Isolation of Novel and Known Genes from a Human Fetal Cochlear cDNA Library Using Subtractive Hybridization and Differential ScreeningGenomics, 1994
- 2.2 Mb of contiguous nucleotide sequence from chromosome III of C. elegansNature, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994
- cDNA analyses in the human genome projectGene, 1993
- Basic Local Alignment Search ToolJournal of Molecular Biology, 1990
- Primer-Directed Enzymatic Amplification of DNA with a Thermostable DNA PolymeraseScience, 1988
- A new troponin T and cDNA clones for 13 different muscle proteins, found by shotgun sequencingNature, 1983