Speech Data Rate Reduction Part I: Applicability of Modern Estimation Theory

1 March 1973

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Aerospace and Electronic Systems

Vol. AES-9 (2), 130-138
https://doi.org/10.1109/TAES.1973.309760

Abstract

Efficient coding of continuous speech signals for digital representation has attracted much interest in recent years. The underlying aim of efficient coding methods is to reduce the channel capacity required to represent a signal to meet a specific reconstruction fidelity criterion. To achieve this objective, modern speech data compression techniques rely on two very similar procedures. One procedure uses predictive deconvolution which subtracts from the current signal value that portion which can be predicted from its past and thus removes redundancy in the speech by removing sequential correlation. The signal thus requires fewer bits for equivalent quantization error. The second procedure involves identification of a complete mathematical model of the speech producing mechanism. This involves determination of the characteristics of the source that drives this transfer function. Data reduction is again achieved since the rate of change of the parameters of the speech model is much smaller than the rate of change of the speech waveform. This paper develops these data reduction procedures in terms of modern estimation theory, specifically a Kalman filter model, and illustrates the utility of this model as an analysis tool by means of an example based on a uniform tube which provides a qualitative assessment of the potential of the technique for application to real speech signals.

Keywords

This publication has 5 references indexed in Scilit:

Improved quantizer for adaptive predictive coding of speech signals at low bit rates
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Speech Analysis Synthesis and Perception
Published by Springer Nature ,1972
Speech Analysis and Synthesis by Linear Prediction of the Speech Wave
The Journal of the Acoustical Society of America, 1971
Synthetic voices for computers
IEEE Spectrum, 1970
Quantizing for minimum distortion
IEEE Transactions on Information Theory, 1960

Cited by 13 articles