On smoothing techniques for bigram-based natural language modelling

1 January 1991

conference paper
conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

p. 825-828 vol.2
https://doi.org/10.1109/icassp.1991.150464

Abstract

The authors study various problems related to smoothing bigram probabilities for natural language modeling: the type of interpolation, i.e. linear vs. nonlinear, the optimal estimation of interpolation parameters, and the use of word equivalence classes (parts of speech). A nonlinear interpolation method that results in significant improvements over linear interpolation in the experimental tests is proposed. It is shown that the leaving-one-out method in combination with the maximum likelihood criterion can be efficiently used for the optimal estimation of interpolation parameters. In addition, an automatic clustering procedure is developed for finding word equivalence classes using a maximum likelihood criterion. Experimental results are presented for two text databases: a German database with 100000 words and an English database with 1.1 million words.

Keywords

This publication has 9 references indexed in Scilit:

A 10000-word continuous-speech recognition system
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
A cache-based natural language model for speech recognition
Ieee Transactions On Pattern Analysis and Machine Intelligence, 1990
Estimation of probabilities from sparse data for the language model component of a speech recognizer
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1987
Natural Language Modeling for Phoneme-to-Text Transcription
Ieee Transactions On Pattern Analysis and Machine Intelligence, 1986
On Turing's formula for word probabilities
IEEE Transactions on Acoustics, Speech, and Signal Processing, 1985
Markov Source Modeling of Text Generation
Published by Springer Science and Business Media LLC ,1985
A Maximum Likelihood Approach to Continuous Speech Recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1983
Theory of Point Estimation
Published by Springer Science and Business Media LLC ,1983
The Population Frequencies of Species and the Estimation of Population Parameters
Biometrika, 1953

Cited by 30 articles