Multi-column deep neural networks for image classification
Top Cited Papers
- 1 June 2012
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10636919,p. 3642-3649
- https://doi.org/10.1109/cvpr.2012.6248110
Abstract
Traditional methods of computer vision and machine learning cannot match human performance on tasks such as the recognition of handwritten digits or traffic signs. Our biologically plausible, wide and deep artificial neural network architectures can. Small (often minimal) receptive fields of convolutional winner-take-all neurons yield large network depth, resulting in roughly as many sparsely connected neural layers as found in mammals between retina and visual cortex. Only winner neurons are trained. Several deep neural columns become experts on inputs preprocessed in different ways; their predictions are averaged. Graphics cards allow for fast training. On the very competitive MNIST handwriting benchmark, our method is the first to achieve near-human performance. On a traffic sign recognition benchmark it outperforms humans by a factor of two. We also improve the state-of-the-art on a plethora of common image classification benchmarks.Keywords
All Related Versions
This publication has 21 references indexed in Scilit:
- The German Traffic Sign Recognition Benchmark: A multi-class classification competitionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2011
- Performance and Scalability of GPU-Based Convolutional Neural NetworksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2010
- Large-scale object recognition with CUDA-accelerated hierarchical neural networksPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2009
- Supervised Learning of Fuzzy ARTMAP Neural Networks Through Particle Swarm OptimisationJournal of Pattern Recognition Research, 2007
- Unconstrained handwritten character recognition using metaclasses of charactersPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Learning methods for generic object recognition with invariance to pose and lightingPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998
- Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in positionBiological Cybernetics, 1980
- Receptive fields, binocular interaction and functional architecture in the cat's visual cortexThe Journal of Physiology, 1962
- Receptive fields of single neurones in the cat's striate cortexThe Journal of Physiology, 1959