PipeOnline 2.0: automated EST processing and functional data sorting

Open Access

1 November 2002

journal article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 30 (21), 4761-4769
https://doi.org/10.1093/nar/gkf585

Abstract

Expressed sequence tags (ESTs) are generated and deposited in the public domain, as redundant, un‐annotated, single‐pass reactions, with virtually no biological content. PipeOnline automatically analyses and transforms large collections of raw DNA‐sequence data from chromatograms or FASTA files by calling the quality of bases, screening and removing vector sequences, assembling and rewriting consensus sequences of redundant input files into a unigene EST data set and finally through translation, amino acid sequence similarity searches, annotation of public databases and functional data. PipeOnline generates an annota ted database, retaining the processed unigene sequence, clone/file history, alignments with similar sequences, and proposed functional classification, if available. Functional annotation is automatic and based on a novel method that relies on homology of amino acid sequence multiplicity within GenBank records. Records are examined through a function ordered browser or keyword queries with automated export of results. PipeOnline offers customization for individual projects (MyPipeOnline), automated updating and alert service. PipeOnline is available at http://stress‐genomics.org.

Keywords

This publication has 26 references indexed in Scilit:

Cross-Referencing Eukaryotic Genomes: TIGR Orthologous Gene Alignments (TOGA)
Genome Research, 2002
Genome analysis with gene-indexing databases
Pharmacology & Therapeutics, 2001
Assembly, Annotation, and Integration of UNIGENE Clusters into the Human Genome Draft
Genome Research, 2001
STACK: Sequence Tag Alignment and Consensus Knowledgebase
Nucleic Acids Research, 2001
The TIGR Gene Indices: analysis of gene transcript sequences in highly sampled eukaryotic species
Nucleic Acids Research, 2001
Mendel-GFDb and Mendel-ESTS: databases of plant gene families and ESTs annotated with gene family numbers and gene family names
Nucleic Acids Research, 2001
Recent developments and future directions in computational genomics
FEBS Letters, 2000
Gene Ontology: tool for the unification of biology
Nature Genetics, 2000
EST databases as multi-conditional gene expression datasets
Pacific Symposium on Biocomputing, 1999
Automated genome sequence analysis and annotation.
Bioinformatics, 1999

Cited by 45 articles