Video Google: a text retrieval approach to object matching in videos
Top Cited Papers
- 1 January 2003
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 1470-1477 vol.2
- https://doi.org/10.1109/iccv.2003.1238663
Abstract
We describe an approach to object and scene retrieval which searches for and localizes all the occurrences of a user outlined object in a video. The object is represented by a set of viewpoint invariant region descriptors so that recognition can proceed successfully despite changes in viewpoint, illumination and partial occlusion. The temporal continuity of the video within a shot is used to track the regions in order to reject unstable regions and reduce the effects of noise in the descriptors. The analogy with text retrieval is in the implementation where matches on descriptors are pre-computed (using vector quantization), and inverted file systems and document rankings are used. The result is that retrieved is immediate, returning a ranked list of key frames/shots in the manner of Google. The method is illustrated for matching in two full length feature films.Keywords
This publication has 9 references indexed in Scilit:
- Local feature view clustering for 3D object recognitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- A performance evaluation of local descriptorsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2003
- Reliable feature matching across widely separated viewsPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Robust Wide Baseline Stereo from Maximally Stable Extremal RegionsPublished by British Machine Vision Association and Society for Pattern Recognition ,2002
- Content-based query of image databases: inspirations from text retrievalPattern Recognition Letters, 2000
- Wide Baseline Stereo Matching based on Local, Affinely Invariant RegionsPublished by British Machine Vision Association and Society for Pattern Recognition ,2000
- Object recognition from local scale-invariant featuresPublished by Institute of Electrical and Electronics Engineers (IEEE) ,1999
- The anatomy of a large-scale hypertextual Web search engineComputer Networks and ISDN Systems, 1998
- Local grayvalue invariants for image retrievalIEEE Transactions on Pattern Analysis and Machine Intelligence, 1997