Mixture autoregressive hidden Markov models for speech signals

1 December 1985

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Acoustics, Speech, and Signal Processing

Vol. 33 (6), 1404-1413
https://doi.org/10.1109/tassp.1985.1164727

Abstract

In this paper a signal modeling technique based upon finite mixture autoregressive probabilistic functions of Markov chains is developed and applied to the problem of speech recognition, particularly speaker-independent recognition of isolated digits. Two types of mixture probability densities are investigated: finite mixtures of Gaussian autoregressive densities (GAM) and nearest-neighbor partitioned finite mixtures of Gaussian autoregressive densities (PGAM). In the former (GAM), the observation density in each Markov state is simply a (stochastically constrained) weighted sum of Gaussian autoregressive densities, while in the latter (PGAM) it involves nearest-neighbor decoding which in effect, defines a set of partitions on the observation space. In this paper we discuss the signal modeling methodology and give experimental results on speaker independent recognition of isolated digits. We also discuss the potential use of the modeling technique for other applications.

Keywords

This publication has 18 references indexed in Scilit:

Some Properties of Continuous Hidden Markov Model Representations
AT&T Technical Journal, 1985
Recognition of Isolated Digits Using Hidden Markov Models With Continuous Mixture Densities
AT&T Technical Journal, 1985
A Probabilistic Distance Measure for Hidden Markov Models
AT&T Technical Journal, 1985
An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech Recognition
Bell System Technical Journal, 1983
Rate-distortion speech coding with a minimum discrimination information distortion measure
IEEE Transactions on Information Theory, 1981
A simplified, robust training procedure for speaker trained, isolated word recognition systems
The Journal of the Acoustical Society of America, 1980
Speech coding based upon vector quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
Speaker-independent recognition of isolated words using clustering techniques
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
Linear Prediction of Speech
Communication and Cybernetics, 1976
The viterbi algorithm
Proceedings of the IEEE, 1973

Cited by 200 articles