MultiPipMaker and supporting tools: alignments and analysis of multiple genomic DNA sequences
- 1 July 2003
- journal article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 31 (13), 3518-3524
- https://doi.org/10.1093/nar/gkg579
Abstract
Analysis of multiple sequence alignments can generate important, testable hypotheses about the phylogenetic history and cellular function of genomic sequences. We describe the MultiPipMaker server, which aligns multiple, long genomic DNA sequences quickly and with good sensitivity (available at http://bio.cse.psu.edu/ since May 2001). Alignments are computed between a contiguous reference sequence and one or more secondary sequences, which can be finished or draft sequence. The outputs include a stacked set of percent identity plots, called a MultiPip, comparing the reference sequence with subsequent sequences, and a nucleotide-level multiple alignment. New tools are provided to search MultiPipMaker output for conserved matches to a user-specified pattern and for conserved matches to position weight matrices that describe transcription factor binding sites (singly and in clusters). We illustrate the use of MultiPipMaker to identify candidate regulatory regions in WNT2 and then demonstrate by transfection assays that they are functional. Analysis of the alignments also confirms the phylogenetic inference that horses are more closely related to cats than to cows.Keywords
This publication has 44 references indexed in Scilit:
- Distinguishing Regulatory DNA From Neutral SitesGenome Research, 2003
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Comparative Sequence Analysis of the Mouse and Human Lgn1/SMA IntervalGenomics, 1999
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Locus control regions of mammalian β-globin gene clusters: combining phylogenetic analyses and experimental results to gain functional insightsGene, 1997
- Evolutionary Strategies for the Elucidation ofcisandtransFactors That Regulate the Developmental Switching Programs of the β-like Globin GenesMolecular Phylogenetics and Evolution, 1996
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Sequence and comparative analysis of the rabbit α-like globin gene cluster reveals a rapid mode of evolution in a G + C-rich region of mammalian genomesJournal of Molecular Biology, 1991
- Gap costs for multiple sequence alignmentJournal of Theoretical Biology, 1989