Lip modeling for visual speech recognition

17 December 2002

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1 (10586393), 587-590
https://doi.org/10.1109/acssc.1994.471520

Abstract

In this paper, we describe an algorithm for modeling the shape of the mouth, and extracting meaningful dimensions for use by automatic lipreading systems. One advantage of this technique lies in the ability to normalize the model to compensate for scale and rotation. An error function is defined which relates the model to the image, and minimization of the error yields the best fit model. This is similar to deformable templates, but we attempt to perform the minimization in closed form. Visual only recognition was performed with features extracted from the model, and the recognition system achieved 85% accuracy on a two word discrimination task.

Keywords

This publication has 3 references indexed in Scilit:

Neural network lipreading system for improved speech recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Feature extraction from faces using deformable templates
International Journal of Computer Vision, 1992
Automatic optically-based recognition of speech
Pattern Recognition Letters, 1988

Cited by 14 articles