Unsupervised Learning of Image Transformations
- 1 June 2007
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 1-8
- https://doi.org/10.1109/cvpr.2007.383036
Abstract
We describe a probabilistic model for learning rich, distributed representations of image transformations. The basic model is defined as a gated conditional random field that is trained to predict transformations of its inputs using a factorial set of latent variables. Inference in the model consists in extracting the transformation, given a pair of images, and can be performed exactly and efficiently. We show that, when trained on natural videos, the model develops domain specific motion features, in the form of fields of locally transformed edge filters. When trained on affine, or more general, transformations of still images, the model develops codes for these transformations, and can subsequently perform recognition tasks that are invariant under these transformations. It can also fantasize new transformations on previously unseen images. We describe several variations of the basic model and provide experimental results that demonstrate its applicability to a variety of tasks.Keywords
This publication has 10 references indexed in Scilit:
- Reducing the Dimensionality of Data with Neural NetworksScience, 2006
- Kernel information embeddingsPublished by Association for Computing Machinery (ACM) ,2006
- Contour detection based on nonclassical receptive field inhibitionIEEE Transactions on Image Processing, 2003
- Learning from one example through shared densities on transformsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Shape matching and object recognition using shape contextsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Training Products of Experts by Minimizing Contrastive DivergenceNeural Computation, 2002
- Image analogiesPublished by Association for Computing Machinery (ACM) ,2001
- Separating Style and Content with Bilinear ModelsNeural Computation, 2000
- Design and Use of Linear Models for Image Motion AnalysisInternational Journal of Computer Vision, 2000
- Hierarchical Mixtures of Experts and the EM AlgorithmNeural Computation, 1994