Analysis of the mouse transcriptome based on functional annotation of 60,770 full-length cDNAs
Top Cited Papers
Open Access
- 5 December 2002
- journal article
- research article
- Published by Springer Nature in Nature
- Vol. 420 (6915), 563-573
- https://doi.org/10.1038/nature01266
Abstract
Only a small proportion of the mouse genome is transcribed into mature messenger RNA transcripts. There is an international collaborative effort to identify all full-length mRNA transcripts from the mouse, and to ensure that each is represented in a physical collection of clones. Here we report the manual annotation of 60,770 full-length mouse complementary DNA sequences. These are clustered into 33,409 ‘transcriptional units’, contributing 90.1% of a newly established mouse transcriptome database. Of these transcriptional units, 4,258 are new protein-coding and 11,665 are new non-coding messages, indicating that non-coding RNA is a major component of the transcriptome. 41% of all transcriptional units showed evidence of alternative splicing. In protein-coding transcripts, 79% of splice variations altered the protein product. Whole-transcriptome analyses resulted in the identification of 2,431 sense–antisense pairs. The present work, completely supported by physical clones, provides the most comprehensive survey of a mammalian transcriptome so far, and is a valuable resource for functional genomics.Keywords
This publication has 50 references indexed in Scilit:
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002
- Classification and evolution of P-loop GTPases and related ATPasesJournal of Molecular Biology, 2002
- The non-coding Air RNA is required for silencing autosomal imprinted genesNature, 2002
- Molecular Fossils in the Human Genome: Identification and Analysis of the Pseudogenes in Chromosomes 21 and 22Genome Research, 2002
- Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structureJournal of Molecular Biology, 2001
- Balanced-Size and Long-Size Cloning of Full-Length, Cap-Trapped cDNAs into Vectors of the Novel λ-FLC Family Allows Enhanced Gene Discovery Rate and Functional AnalysisGenomics, 2001
- PROGRAM DESCRIPTIONGenomics, 2001
- Initial sequencing and analysis of the human genomeNature, 2001
- Computer-Based Methods for the Mouse Full-Length cDNA Encyclopedia: Real-Time Sequence Clustering for Construction of a Nonredundant cDNA LibraryGenome Research, 2001
- Predicting transmembrane protein topology with a hidden markov model: application to complete genomes11Edited by F. CohenJournal of Molecular Biology, 2001