Large-scale object recognition with CUDA-accelerated hierarchical neural networks
- 1 November 2009
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- Vol. 2766, 536-541
- https://doi.org/10.1109/icicisys.2009.5357786
Abstract
Robust recognition of arbitrary object classes in natural visual scenes is an aspiring goal with numerous practical applications, for instance, in the area of autonomous robotics and autonomous vehicles. One obstacle on the way towards human-like recognition performance is the limitation of computational power, restricting the size of the training and testing dataset as well as the complexity of the object recognition system. In this work, we present a hierarchical, locally-connected neural network model that is well-suited for large-scale, high-performance object recognition. By using the NVIDIA CUDA framework, we create a massively parallel implementation of the model which is executed on a state-of-the-art graphics card. This implementation is up to 82 times faster than a single-core CPU version of the system. This significant gain in computational performance allows us to evaluate the model on a very large, realistic, and challenging set of natural images which we extracted from the LabelMe dataset. To compare our model to other approaches, we also evaluate the recognition performance using the well-known MNIST and NORB datasets, achieving a testing error rate of 0.76% and 2.87%, respectively.Keywords
This publication has 8 references indexed in Scilit:
- Why is Real-World Visual Object Recognition Hard?PLoS Computational Biology, 2008
- LabelMe: A Database and Web-Based Tool for Image AnnotationInternational Journal of Computer Vision, 2007
- Learning generative visual models from few training examples: An incremental Bayesian approach tested on 101 object categoriesComputer Vision and Image Understanding, 2007
- Learning methods for generic object recognition with invariance to pose and lightingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Hierarchical Neural Networks for Image InterpretationLecture Notes in Computer Science, 2003
- Efficient BackPropPublished by Springer Nature ,1998
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998
- The perceptron: A probabilistic model for information storage and organization in the brain.Psychological Review, 1958