Matching Local Self-Similarities across Images and Videos
Top Cited Papers
- 1 June 2007
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 1-8
- https://doi.org/10.1109/cvpr.2007.383198
Abstract
We present an approach for measuring similarity between visual entities (images or videos) based on matching internal self-similarities. What is correlated across images (or across video sequences) is the internal layout of local self-similarities (up to some distortions), even though the patterns generating those local self-similarities are quite different in each of the images/videos. These internal self-similarities are efficiently captured by a compact local "self-similarity descriptor"', measured densely throughout the image/video, at multiple scales, while accounting for local and global geometric distortions. This gives rise to matching capabilities of complex visual data, including detection of objects in real cluttered images using only rough hand-sketches, handling textured objects with no clear boundaries, and detecting complex actions in cluttered video data with no prior learning. We compare our measure to commonly used image-based and video-based similarity measures, and demonstrate its applicability to object detection, retrieval, and action detection.Keywords
This publication has 18 references indexed in Scilit:
- Robust Object Recognition with Cortex-Like MechanismsIEEE Transactions on Pattern Analysis and Machine Intelligence, 2007
- Unsupervised Learning of Human Action Categories Using Spatial-Temporal WordsPublished by British Machine Vision Association and Society for Pattern Recognition ,2006
- Similarity templates for detection and recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A performance evaluation of local descriptorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Space-Time Behavior Based CorrelationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Capturing image structure with probabilistic index mapsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Space-time interest pointsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Recognition without Correspondence using Multidimensional Receptive Field HistogramsInternational Journal of Computer Vision, 2000
- Textons, contours and regions: cue integration in image segmentationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999