Multi-column deep neural networks for image classification

Top Cited Papers

1 June 2012

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10636919,p. 3642-3649
https://doi.org/10.1109/cvpr.2012.6248110

Abstract

Traditional methods of computer vision and machine learning cannot match human performance on tasks such as the recognition of handwritten digits or traffic signs. Our biologically plausible, wide and deep artificial neural network architectures can. Small (often minimal) receptive fields of convolutional winner-take-all neurons yield large network depth, resulting in roughly as many sparsely connected neural layers as found in mammals between retina and visual cortex. Only winner neurons are trained. Several deep neural columns become experts on inputs preprocessed in different ways; their predictions are averaged. Graphics cards allow for fast training. On the very competitive MNIST handwriting benchmark, our method is the first to achieve near-human performance. On a traffic sign recognition benchmark it outperforms humans by a factor of two. We also improve the state-of-the-art on a plethora of common image classification benchmarks.

Keywords

All Related Versions

Version 1, 2012-02-13, ArXiv (Unconfirmed version)

This publication has 21 references indexed in Scilit:

The German Traffic Sign Recognition Benchmark: A multi-class classification competition
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2011
Performance and Scalability of GPU-Based Convolutional Neural Networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2010
Large-scale object recognition with CUDA-accelerated hierarchical neural networks
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2009
Supervised Learning of Fuzzy ARTMAP Neural Networks Through Particle Swarm Optimisation
Journal of Pattern Recognition Research, 2007
Unconstrained handwritten character recognition using metaclasses of characters
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Learning methods for generic object recognition with invariance to pose and lighting
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998
Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position
Biological Cybernetics, 1980
Receptive fields, binocular interaction and functional architecture in the cat's visual cortex
The Journal of Physiology, 1962
Receptive fields of single neurones in the cat's striate cortex
The Journal of Physiology, 1959

Cited by 1943 articles