Large-scale object recognition with CUDA-accelerated hierarchical neural networks

1 November 2009

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

Vol. 2766, 536-541
https://doi.org/10.1109/icicisys.2009.5357786

Abstract

Robust recognition of arbitrary object classes in natural visual scenes is an aspiring goal with numerous practical applications, for instance, in the area of autonomous robotics and autonomous vehicles. One obstacle on the way towards human-like recognition performance is the limitation of computational power, restricting the size of the training and testing dataset as well as the complexity of the object recognition system. In this work, we present a hierarchical, locally-connected neural network model that is well-suited for large-scale, high-performance object recognition. By using the NVIDIA CUDA framework, we create a massively parallel implementation of the model which is executed on a state-of-the-art graphics card. This implementation is up to 82 times faster than a single-core CPU version of the system. This significant gain in computational performance allows us to evaluate the model on a very large, realistic, and challenging set of natural images which we extracted from the LabelMe dataset. To compare our model to other approaches, we also evaluate the recognition performance using the well-known MNIST and NORB datasets, achieving a testing error rate of 0.76% and 2.87%, respectively.

Keywords

This publication has 8 references indexed in Scilit:

Why is Real-World Visual Object Recognition Hard?
PLoS Computational Biology, 2008
LabelMe: A Database and Web-Based Tool for Image Annotation
International Journal of Computer Vision, 2007
Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categories
Computer Vision and Image Understanding, 2007
Learning methods for generic object recognition with invariance to pose and lighting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Hierarchical Neural Networks for Image Interpretation
Lecture Notes in Computer Science, 2003
Efficient BackProp
Published by Springer Nature ,1998
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998
The perceptron: A probabilistic model for information storage and organization in the brain.
Psychological Review, 1958

Cited by 45 articles