Mauve: Multiple Alignment of Conserved Genomic Sequence With Rearrangements
Top Cited Papers
Open Access
- 1 July 2004
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 14 (7), 1394-1403
- https://doi.org/10.1101/gr.2289704
Abstract
As genomes evolve, they undergo large-scale evolutionary processes that present a challenge to sequence comparison not posed by short sequences. Recombination causes frequent genome rearrangements, horizontal transfer introduces new sequences into bacterial chromosomes, and deletions remove segments of the genome. Consequently, each genome is a mosaic of unique lineage-specific segments, regions shared with a subset of other genomes and segments conserved among all the genomes under consideration. Furthermore, the linear order of these segments may be shuffled among genomes. We present methods for identification and alignment of conserved genomic DNA in the presence of rearrangements and horizontal transfer. Our methods have been implemented in a software package called Mauve. Mauve has been applied to align nine enterobacterial genomes and to determine global rearrangement structure in three mammalian genomes. We have evaluated the quality of Mauve alignments and drawn comparison to other methods through extensive simulations of genome evolution.Keywords
This publication has 45 references indexed in Scilit:
- Complete Genome Sequence and Comparative Genomics ofShigella flexneriSerotype 2a Strain 2457TInfection and Immunity, 2003
- LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNAGenome Research, 2003
- Genome Rearrangements in Mammalian Evolution: Lessons From Human and Mouse GenomesGenome Research, 2002
- Bayesian Phylogenetic Inference from Animal Mitochondrial Genome ArrangementsJournal of the Royal Statistical Society Series B: Statistical Methodology, 2002
- Caloramator viterbensis sp. nov., a novel thermophilic, glycerol-fermenting bacterium isolated from a hot spring in ItalyInternational Journal of Systematic and Evolutionary Microbiology, 2002
- A Linear-Time Algorithm for Computing Inversion Distance between Signed Permutations with an Experimental StudyJournal of Computational Biology, 2001
- Genome rearrangement by replication-directed translocationNature Genetics, 2000
- T-coffee: a novel method for fast and accurate multiple sequence alignment 1 1Edited by J. ThorntonJournal of Molecular Biology, 2000
- The Complete Genome Sequence of Escherichia coli K-12Science, 1997
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994