Formant estimation for speech recognition
- 1 January 1998
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Speech and Audio Processing
- Vol. 6 (1), 36-48
- https://doi.org/10.1109/89.650308
Abstract
This paper presents a new method for estimating formant frequencies. The formant model is based on a digital resonator. Each resonator represents a segment of the short-time power spectrum. The complete spectrum is modeled by a set of digital resonators connected in parallel. An algorithm based on dynamic programming produces both the model parameters and the segment boundaries that optimally match the spectrum. We used this method in experimental tests that were carried out on the TI digit string data base. The main results of the experimental tests are: (1) the presented approach produces reliable estimates of formant frequencies across a wide range of sounds and speakers; and (2) the estimated formant frequencies were used in a number of variants for recognition. The best set-up resulted in a string error rate of 4.2% on the adult corpus of the TI digit string data base.Keywords
This publication has 19 references indexed in Scilit:
- A database for speaker-independent digit recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A comparison of several acoustic representations for speech recognition with degraded and undegraded speechPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Speaker dependent and independent speech recognition experiments with an auditory modelPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- A model for efficient formant estimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1996
- Improvements in connected digit recognition using linear discriminant analysis and mixture densitiesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1993
- Improved acoustic modeling with Bayesian learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1992
- Formant tracking using hidden Markov models and vector quantizationIEEE Transactions on Acoustics, Speech, and Signal Processing, 1986
- Software for a cascade/parallel formant synthesizerThe Journal of the Acoustical Society of America, 1980
- A method for segmenting acoustic patterns, with applications to automatic speech recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1977
- Linear prediction: A tutorial reviewProceedings of the IEEE, 1975