Maximum Discrimination Hidden Markov Models of Sequence Consensus

1 January 1995

journal article
research article
Published by Mary Ann Liebert Inc in Journal of Computational Biology

Vol. 2 (1), 9-23
https://doi.org/10.1089/cmb.1995.2.9

Abstract

We introduce a maximum discrimination method for building hidden Markov models (HMMs) of protein or nucleic acid primary sequence consensus. The method compensates for biased representation in sequence data sets, superseding the need for sequence weighting methods. Maximum discrimination HMMs are more sensitive for detecting distant sequence homologs than various other HMM methods or BLAST when tested on globin and protein kinase catalytic domain sequences. Key words: hidden Markov model; database searching; sequence consensus; sequence weighting

Keywords

This publication has 32 references indexed in Scilit:

Hidden Markov models of biological primary sequence information.
Proceedings of the National Academy of Sciences, 1994
The PROSITE dictionary of sites and patterns in proteins, its current status
Nucleic Acids Research, 1993
Comprehensive sequence analysis of the 182 predicted open reading frames of yeast chromosome III
Protein Science, 1992
One thousand families for the molecular biologist
Nature, 1992
Polar zipper sequence in the high-affinity hemoglobin of Ascaris suum: amino acid sequence and structural interpretation.
Proceedings of the National Academy of Sciences of the United States of America, 1992
Amino acid substitution matrices from an information theoretic perspective
Journal of Molecular Biology, 1991
Basic local alignment search tool
Journal of Molecular Biology, 1990
[25] Protein multiple sequence alignment and flexible pattern matching
Published by Elsevier BV ,1990
Weights for data related by a tree
Journal of Molecular Biology, 1989
Determinants of a protein fold: Unique features of the globin amino acid sequences
Journal of Molecular Biology, 1987

Cited by 171 articles