80 Million Tiny Images: A Large Data Set for Nonparametric Object and Scene Recognition

Top Cited Papers

30 May 2008

journal article
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 30 (11), 1958-1970
https://doi.org/10.1109/tpami.2008.128

Abstract

With the advent of the Internet, billions of images are now freely available online and constitute a dense sampling of the visual world. Using a variety of non-parametric methods, we explore this world with the aid of a large dataset of 79,302,017 images collected from the Internet. Motivated by psychophysical results showing the remarkable tolerance of the human visual system to degradations in image resolution, the images in the dataset are stored as 32 x 32 color images. Each image is loosely labeled with one of the 75,062 non-abstract nouns in English, as listed in the Wordnet lexical database. Hence the image database gives a comprehensive coverage of all object categories and scenes. The semantic information from Wordnet can be used in conjunction with nearest-neighbor methods to perform object classification over a range of semantic levels minimizing the effects of labeling noise. For certain classes that are particularly prevalent in the dataset, such as people, we are able to demonstrate a recognition performance comparable to class-specific Viola-Jones style detectors.

Keywords

This publication has 32 references indexed in Scilit:

Image retrieval
ACM Computing Surveys, 2008
Object and scene recognition in tiny images
Journal of Vision, 2007
Photo tourism
ACM Transactions on Graphics, 2006
Diagnostic Colors Mediate Scene Recognition
Cognitive Psychology, 2000
10.1162/153244303322533214
Applied Physics Letters, 2000
Content-based image indexing and searching using Daubechies' wavelets
International Journal on Digital Libraries, 1998
Minimax Entropy Principle and Its Application to Texture Modeling
Neural Computation, 1997
Query by image and video content: the QBIC system
Computer, 1995
Identification of spatially quantised tachistoscopic images of faces: How many pixels does it take to carry identity?
The European Journal of Cognitive Psychology, 1991
Masking in Visual Recognition: Effects of Two-Dimensional Filtered Noise
Science, 1973

Cited by 1350 articles