High-Throughput Gene Mapping in Caenorhabditis elegans
- 1 July 2002
- journal article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 12 (7), 1100-1105
- https://doi.org/10.1101/gr.208902
Abstract
We describe a new computer system, calledARACHNE, for assembling genome sequence using paired-end whole-genome shotgun reads. ARACHNEhas several key features, including an efficient and sensitive procedure for finding read overlaps, a procedure for scoring overlaps that achieves high accuracy by correcting errors before assembly, read merger based on forward-reverse links, and detection of repeat contigs by forward-reverse link inconsistency. To testARACHNE, we created simulated reads providing ∼10-fold coverage of the genomes of H. influenzae, S. cerevisiae, and D. melanogaster, as well as human chromosomes 21 and 22. The assemblies of these simulated reads yielded nearly complete coverage of the respective genomes, with a small number of contigs joined into a smaller number of supercontigs (or scaffolds). For example, analysis of the D. melanogaster genome yielded ∼98% coverage with an N50 contig length of 324 kb and an N50 supercontig length of 5143 kb. The assembly accuracy was high, although not perfect: small errors occurred at a frequency of roughly 1 per 1 Mb (typically, deletion of ∼1 kb in size), with a very small number of other misassemblies. The assembly was rapid: the Drosophilaassembly required only 21 hours on a single 667 MHz processor and used 8.4 Gb of memory.Keywords
This publication has 20 references indexed in Scilit:
- A genomic bias for genotype–environment interactions in C. elegansMolecular Systems Biology, 2012
- Development and characterization of genome-wide single nucleotide polymorphism markers in the green alga Chlamydomonas reinhardtii.2001
- Development and Characterization of Genome-Wide Single Nucleotide Polymorphism Markers in the Green Alga Chlamydomonas reinhardtiiPlant Physiology, 2001
- Methods for Genotyping Single Nucleotide PolymorphismsAnnual Review of Genomics and Human Genetics, 2001
- Rapid gene mapping in Caenorhabditis elegans using a high density polymorphism mapNature Genetics, 2001
- Working in the Post-Genomic C. elegans WorldCell, 2001
- Large-scale analysis of gene function in Caenorhabditis elegans by high-throughput RNAiCurrent Biology, 2001
- WormBase: network access to the genome and biology of Caenorhabditis elegansNucleic Acids Research, 2001
- Genome Sequence of the Nematode C. elegans : A Platform for Investigating BiologyScience, 1998
- Positional cloning moves from perditional to traditionalNature Genetics, 1995