Estimating mixture models of images and inferring spatial transformations using the EM algorithm

20 January 2003

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 1, 416-422 Vol. 1
https://doi.org/10.1109/cvpr.1999.786972

Abstract

Mixture modeling and clustering algorithms are effective, simple ways to represent images using a set of data centers. However, in situations where the images include background clutter and transformations such as translation, rotation, shearing and warping, these methods extract data centers that include clutter and represent different transformations of essentially the same data. Taking face images as an example, it would be more useful for the different clusters to represent different poses and expressions, instead of cluttered versions of different translations, scales and rotations. By including clutter and transformation as unobserved, latent variables in a mixture model, we obtain a new "transformed mixture of Gaussians", which is invariant to a specified set of transformations. We show how a linear-time EM algorithm can be used to fit this model by jointly estimating a mixture model for the data and inferring the transformation for each image. We show that this algorithm can jointly align images of a human head and learn different poses. We also find that the algorithm performs better than k-nearest neighbors and mixtures of Gaussians on handwritten digit recognition.

Keywords

This publication has 9 references indexed in Scilit:

Mixture models for optical flow computation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Rotation invariant neural network-based face detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Smoothness in layers: Motion segmentation using nonparametric mixture estimation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Transformed component analysis: joint estimation of spatial transformations and image components
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999
Probabilistic visual learning for object representation
IEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
Example Based Learning for View-Based Human Face Detection.
Published by Defense Technical Information Center (DTIC) ,1994
Representing moving images with layers
IEEE Transactions on Image Processing, 1994
Eigenfaces for Recognition
Journal of Cognitive Neuroscience, 1991
EM algorithms for ML factor analysis
Psychometrika, 1982

Cited by 32 articles