Mixture autoregressive hidden Markov models for speech signals
- 1 December 1985
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Acoustics, Speech, and Signal Processing
- Vol. 33 (6), 1404-1413
- https://doi.org/10.1109/tassp.1985.1164727
Abstract
In this paper a signal modeling technique based upon finite mixture autoregressive probabilistic functions of Markov chains is developed and applied to the problem of speech recognition, particularly speaker-independent recognition of isolated digits. Two types of mixture probability densities are investigated: finite mixtures of Gaussian autoregressive densities (GAM) and nearest-neighbor partitioned finite mixtures of Gaussian autoregressive densities (PGAM). In the former (GAM), the observation density in each Markov state is simply a (stochastically constrained) weighted sum of Gaussian autoregressive densities, while in the latter (PGAM) it involves nearest-neighbor decoding which in effect, defines a set of partitions on the observation space. In this paper we discuss the signal modeling methodology and give experimental results on speaker independent recognition of isolated digits. We also discuss the potential use of the modeling technique for other applications.Keywords
This publication has 18 references indexed in Scilit:
- Some Properties of Continuous Hidden Markov Model RepresentationsAT&T Technical Journal, 1985
- Recognition of Isolated Digits Using Hidden Markov Models With Continuous Mixture DensitiesAT&T Technical Journal, 1985
- A Probabilistic Distance Measure for Hidden Markov ModelsAT&T Technical Journal, 1985
- An Introduction to the Application of the Theory of Probabilistic Functions of a Markov Process to Automatic Speech RecognitionBell System Technical Journal, 1983
- Rate-distortion speech coding with a minimum discrimination information distortion measureIEEE Transactions on Information Theory, 1981
- A simplified, robust training procedure for speaker trained, isolated word recognition systemsThe Journal of the Acoustical Society of America, 1980
- Speech coding based upon vector quantizationIEEE Transactions on Acoustics, Speech, and Signal Processing, 1980
- Speaker-independent recognition of isolated words using clustering techniquesIEEE Transactions on Acoustics, Speech, and Signal Processing, 1979
- Linear Prediction of SpeechCommunication and Cybernetics, 1976
- The viterbi algorithmProceedings of the IEEE, 1973