Unsupervised Learning of Image Transformations

1 June 2007

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 1-8
https://doi.org/10.1109/cvpr.2007.383036

Abstract

We describe a probabilistic model for learning rich, distributed representations of image transformations. The basic model is defined as a gated conditional random field that is trained to predict transformations of its inputs using a factorial set of latent variables. Inference in the model consists in extracting the transformation, given a pair of images, and can be performed exactly and efficiently. We show that, when trained on natural videos, the model develops domain specific motion features, in the form of fields of locally transformed edge filters. When trained on affine, or more general, transformations of still images, the model develops codes for these transformations, and can subsequently perform recognition tasks that are invariant under these transformations. It can also fantasize new transformations on previously unseen images. We describe several variations of the basic model and provide experimental results that demonstrate its applicability to a variety of tasks.

Keywords

This publication has 10 references indexed in Scilit:

Reducing the Dimensionality of Data with Neural Networks
Science, 2006
Kernel information embeddings
Published by Association for Computing Machinery (ACM) ,2006
Contour detection based on nonclassical receptive field inhibition
IEEE Transactions on Image Processing, 2003
Learning from one example through shared densities on transforms
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Shape matching and object recognition using shape contexts
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2002
Training Products of Experts by Minimizing Contrastive Divergence
Neural Computation, 2002
Image analogies
Published by Association for Computing Machinery (ACM) ,2001
Separating Style and Content with Bilinear Models
Neural Computation, 2000
Design and Use of Linear Models for Image Motion Analysis
International Journal of Computer Vision, 2000
Hierarchical Mixtures of Experts and the EM Algorithm
Neural Computation, 1994

Cited by 77 articles