Beyond sliding windows: Object localization by efficient subwindow search
Top Cited Papers
- 1 June 2008
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 1-8
- https://doi.org/10.1109/cvpr.2008.4587586
Abstract
Most successful object recognition systems rely on binary classification, deciding only if an object is present or not, but not providing information on the actual object location. To perform localization, one can take a sliding window approach, but this strongly increases the computational cost, because the classifier function has to be evaluated over a large set of candidate subwindows. In this paper, we propose a simple yet powerful branch-and-bound scheme that allows efficient maximization of a large class of classifier functions over all possible subimages. It converges to a globally optimal solution typically in sublinear time. We show how our method is applicable to different object detection and retrieval scenarios. The achieved speedup allows the use of classifiers for localization that formerly were considered too slow for this task, such as SVMs with a spatial pyramid kernel or nearest neighbor classifiers based on the chi2-distance. We demonstrate state-of-the-art performance of the resulting systems on the UIUC Cars dataset, the PASCAL VOC 2006 dataset and in the PASCAL VOC 2007 competition.Keywords
This publication has 18 references indexed in Scilit:
- Robust Object Detection with Interleaved Categorization and SegmentationInternational Journal of Computer Vision, 2007
- An Exemplar Model for Learning Object ClassesPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2007
- Local Features and Kernels for Classification of Texture and Object Categories: A Comprehensive StudyInternational Journal of Computer Vision, 2006
- Distinctive Image Features from Scale-Invariant KeypointsInternational Journal of Computer Vision, 2004
- Learning to detect objects in images via a sparse, part-based representationPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Object class recognition by unsupervised scale-invariant learningPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Fast recognition using adaptive subdivisions of transformation spacePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Video Google: a text retrieval approach to object matching in videosPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Selection of scale-invariant parts for object class recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Support vector machines for histogram-based image classificationIEEE Transactions on Neural Networks, 1999