An intermediate grade of finished genomic sequence suitable for comparative analyses

Open Access

12 October 2004

journal article
research article
Published by Cold Spring Harbor Laboratory in Genome Research

Vol. 14 (11), 2235-2244
https://doi.org/10.1101/gr.2648404

Abstract

Although the cost of generating draft-quality genomic sequence continues to decline, refining that sequence by the process of “sequence finishing” remains expensive. Near-perfect finished sequence is an appropriate goal for the human genome and a small set of reference genomes; however, such a high-quality product cannot be cost-justified for large numbers of additional genomes, at least for the foreseeable future. Here we describe the generation and quality of an intermediate grade of finished genomic sequence (termed comparative-grade finished sequence), which is tailored for use in multispecies sequence comparisons. Our analyses indicate that this sequence is very high quality (with the residual gaps and errors mostly falling within repetitive elements) and reflects 99% of the total sequence. Importantly, comparative-grade sequence finishing requires ∼40-fold less reagents and ∼10-fold less personnel effort compared to the generation of near-perfect finished sequence, such as that produced for the human genome. Although applied here to finishing sequence derived from individual bacterial artificial chromosome (BAC) clones, one could envision establishing routines for refining sequences emanating from whole-genome shotgun sequencing projects to a similar quality level. Our experience to date demonstrates that comparative-grade sequence finishing represents a practical and affordable option for sequence refinement en route to comparative analyses.

Keywords

This publication has 45 references indexed in Scilit:

Analysis of Segmental Duplications and Genome Assembly in the Mouse
Genome Research, 2004
Genome sequence of the Brown Norway rat yields insights into mammalian evolution
Nature, 2004
The Genome Sequence of Caenorhabditis briggsae: A Platform for Comparative Genomics
PLoS Biology, 2003
Initial sequencing and comparative analysis of the mouse genome
Nature, 2002
Generation and Comparative Analysis of ∼3.3 Mb of Mouse Genomic Sequence Orthologous to the Region of Human Chromosome 7q11.23 Implicated in Williams Syndrome
Genome Research, 2002
Non-clonability correlates with genomic instability: a case study of a unique DNA region
Journal of Molecular Biology, 2001
Initial sequencing and analysis of the human genome
Nature, 2001
The Genome Sequence of Drosophila melanogaster
Science, 2000
Prediction of complete gene structures in human genomic DNA
Journal of Molecular Biology, 1997
Nucleotide sequence of bacteriophage λ DNA
Journal of Molecular Biology, 1982

Cited by 68 articles