PipeOnline 2.0: automated EST processing and functional data sorting
Open Access
- 1 November 2002
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 30 (21), 4761-4769
- https://doi.org/10.1093/nar/gkf585
Abstract
Expressed sequence tags (ESTs) are generated and deposited in the public domain, as redundant, un‐annotated, single‐pass reactions, with virtually no biological content. PipeOnline automatically analyses and transforms large collections of raw DNA‐sequence data from chromatograms or FASTA files by calling the quality of bases, screening and removing vector sequences, assembling and rewriting consensus sequences of redundant input files into a unigene EST data set and finally through translation, amino acid sequence similarity searches, annotation of public databases and functional data. PipeOnline generates an annota ted database, retaining the processed unigene sequence, clone/file history, alignments with similar sequences, and proposed functional classification, if available. Functional annotation is automatic and based on a novel method that relies on homology of amino acid sequence multiplicity within GenBank records. Records are examined through a function ordered browser or keyword queries with automated export of results. PipeOnline offers customization for individual projects (MyPipeOnline), automated updating and alert service. PipeOnline is available at http://stress‐genomics.org.Keywords
This publication has 26 references indexed in Scilit:
- Cross-Referencing Eukaryotic Genomes: TIGR Orthologous Gene Alignments (TOGA)Genome Research, 2002
- Genome analysis with gene-indexing databasesPharmacology & Therapeutics, 2001
- Assembly, Annotation, and Integration of UNIGENE Clusters into the Human Genome DraftGenome Research, 2001
- STACK: Sequence Tag Alignment and Consensus KnowledgebaseNucleic Acids Research, 2001
- The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic speciesNucleic Acids Research, 2001
- Mendel-GFDb and Mendel-ESTS: databases of plant gene families and ESTs annotated with gene family numbers and gene family namesNucleic Acids Research, 2001
- Recent developments and future directions in computational genomicsFEBS Letters, 2000
- Gene Ontology: tool for the unification of biologyNature Genetics, 2000
- EST databases as multi-conditional gene expression datasetsPacific Symposium on Biocomputing, 1999
- Automated genome sequence analysis and annotation.Bioinformatics, 1999