The Ensembl genome database project

Top Cited Papers

Open Access

1 January 2002

journal article
research article
Published by Oxford University Press (OUP) in Nucleic Acids Research

Vol. 30 (1), 38-41
https://doi.org/10.1093/nar/30.1.38

Abstract

The Ensembl (http://www.ensembl.org/) database project provides a bioinformatics framework to organise biology around the sequences of large genomes. It is a comprehensive source of stable automatic annotation of the human genome sequence, with confirmed gene predictions that have been integrated with external data sources, and is available as either an interactive web site or as flat files. It is also an open source software engineering project to develop a portable system able to handle very large genomes and associated requirements from sequence analysis to data storage and visualisation. The Ensembl site is one of the leading sources of human genome sequence annotation and provided much of the analysis for publication by the international human genome project of the draft genome. The Ensembl system is being installed around the world in both companies and academic sites on machines ranging from supercomputers to laptops.

Keywords

This publication has 14 references indexed in Scilit:

The Distributed Annotation System
BMC Bioinformatics, 2001
Initial sequencing and analysis of the human genome
Nature, 2001
The InterPro database, an integrated documentation resource for protein families, domains and functional sites
Nucleic Acids Research, 2001
OMIM passes the 1,000-disease-gene mark
Nature Genetics, 2000
Using GeneWise in the Drosophila Annotation Experiment
Genome Research, 2000
Open annotation offers a democratic solution to genome sequencing
Nature, 2000
The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000
Nucleic Acids Research, 2000
The DNA sequence of human chromosome 22
Nature, 1999
Protein interaction maps for complete genomes based on gene fusion events
Nature, 1999
EST_GENOME: a program to align spliced DNA sequences to unspliced genomic DNA
Bioinformatics, 1997

Cited by 1317 articles