Speech Data Rate Reduction Part I: Applicability of Modern Estimation Theory
- 1 March 1973
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Aerospace and Electronic Systems
- Vol. AES-9 (2), 130-138
- https://doi.org/10.1109/TAES.1973.309760
Abstract
Efficient coding of continuous speech signals for digital representation has attracted much interest in recent years. The underlying aim of efficient coding methods is to reduce the channel capacity required to represent a signal to meet a specific reconstruction fidelity criterion. To achieve this objective, modern speech data compression techniques rely on two very similar procedures. One procedure uses predictive deconvolution which subtracts from the current signal value that portion which can be predicted from its past and thus removes redundancy in the speech by removing sequential correlation. The signal thus requires fewer bits for equivalent quantization error. The second procedure involves identification of a complete mathematical model of the speech producing mechanism. This involves determination of the characteristics of the source that drives this transfer function. Data reduction is again achieved since the rate of change of the parameters of the speech model is much smaller than the rate of change of the speech waveform. This paper develops these data reduction procedures in terms of modern estimation theory, specifically a Kalman filter model, and illustrates the utility of this model as an analysis tool by means of an example based on a uniform tube which provides a qualitative assessment of the potential of the technique for application to real speech signals.Keywords
This publication has 5 references indexed in Scilit:
- Improved quantizer for adaptive predictive coding of speech signals at low bit ratesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Speech Analysis Synthesis and PerceptionPublished by Springer Nature ,1972
- Speech Analysis and Synthesis by Linear Prediction of the Speech WaveThe Journal of the Acoustical Society of America, 1971
- Synthetic voices for computersIEEE Spectrum, 1970
- Quantizing for minimum distortionIEEE Transactions on Information Theory, 1960