Estimating mixture models of images and inferring spatial transformations using the EM algorithm
- 20 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 1, 416-422 Vol. 1
- https://doi.org/10.1109/cvpr.1999.786972
Abstract
Mixture modeling and clustering algorithms are effective, simple ways to represent images using a set of data centers. However, in situations where the images include background clutter and transformations such as translation, rotation, shearing and warping, these methods extract data centers that include clutter and represent different transformations of essentially the same data. Taking face images as an example, it would be more useful for the different clusters to represent different poses and expressions, instead of cluttered versions of different translations, scales and rotations. By including clutter and transformation as unobserved, latent variables in a mixture model, we obtain a new "transformed mixture of Gaussians", which is invariant to a specified set of transformations. We show how a linear-time EM algorithm can be used to fit this model by jointly estimating a mixture model for the data and inferring the transformation for each image. We show that this algorithm can jointly align images of a human head and learn different poses. We also find that the algorithm performs better than k-nearest neighbors and mixtures of Gaussians on handwritten digit recognition.Keywords
This publication has 9 references indexed in Scilit:
- Mixture models for optical flow computationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Rotation invariant neural network-based face detectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Smoothness in layers: Motion segmentation using nonparametric mixture estimationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Transformed component analysis: joint estimation of spatial transformations and image componentsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- Probabilistic visual learning for object representationIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997
- Example Based Learning for View-Based Human Face Detection.Published by Defense Technical Information Center (DTIC) ,1994
- Representing moving images with layersIEEE Transactions on Image Processing, 1994
- Eigenfaces for RecognitionJournal of Cognitive Neuroscience, 1991
- EM algorithms for ML factor analysisPsychometrika, 1982