Formant estimation for speech recognition

1 January 1998

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing

Vol. 6 (1), 36-48
https://doi.org/10.1109/89.650308

Abstract

This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: (1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and (2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base.

Keywords

This publication has 19 references indexed in Scilit:

A database for speaker-independent digit recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Speaker dependent and independent speech recognition experiments with an auditory model
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
A model for efficient formant estimation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1996
Improvements in connected digit recognition using linear discriminant analysis and mixture densities
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1993
Improved acoustic modeling with Bayesian learning
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1992
Formant tracking using hidden Markov models and vector quantization
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1986
Software for a cascade/parallel formant synthesizer
The Journal of the Acoustical Society of America, 1980
A method for segmenting acoustic patterns, with applications to automatic speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1977
Linear prediction: A tutorial review
Proceedings of the IEEE, 1975

Cited by 67 articles