Matching Local Self-Similarities across Images and Videos

Top Cited Papers

1 June 2007

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 1-8
https://doi.org/10.1109/cvpr.2007.383198

Abstract

We present an approach for measuring similarity between visual entities (images or videos) based on matching internal self-similarities. What is correlated across images (or across video sequences) is the internal layout of local self-similarities (up to some distortions), even though the patterns generating those local self-similarities are quite different in each of the images/videos. These internal self-similarities are efficiently captured by a compact local "self-similarity descriptor"', measured densely throughout the image/video, at multiple scales, while accounting for local and global geometric distortions. This gives rise to matching capabilities of complex visual data, including detection of objects in real cluttered images using only rough hand-sketches, handling textured objects with no clear boundaries, and detecting complex actions in cluttered video data with no prior learning. We compare our measure to commonly used image-based and video-based similarity measures, and demonstrate its applicability to object detection, retrieval, and action detection.

Keywords

This publication has 18 references indexed in Scilit:

Robust Object Recognition with Cortex-Like Mechanisms
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2007
Unsupervised Learning of Human Action Categories Using Spatial-Temporal Words
Published by British Machine Vision Association and Society for Pattern Recognition ,2006
Similarity templates for detection and recognition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
A performance evaluation of local descriptors
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Space-Time Behavior Based Correlation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Capturing image structure with probabilistic index maps
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Distinctive Image Features from Scale-Invariant Keypoints
International Journal of Computer Vision, 2004
Space-time interest points
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2003
Recognition without Correspondence using Multidimensional Receptive Field Histograms
International Journal of Computer Vision, 2000
Textons, contours and regions: cue integration in image segmentation
Published by Institute of Electrical and Electronics Engineers (IEEE) ,1999

Cited by 740 articles