Performance and Scalability of GPU-Based Convolutional Neural Networks
Top Cited Papers
- 1 February 2010
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- No. 10666192,p. 317-324
- https://doi.org/10.1109/pdp.2010.43
Abstract
In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.Keywords
This publication has 12 references indexed in Scilit:
- Chinese License Plate Recognition Using a Convolutional Neural NetworkPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- Processing Neocognitron of Face Recognition on High Performance Environment Based on GPU with CUDA ArchitecturePublished by Institute of Electrical and Electronics Engineers (IEEE) ,2008
- A unified architecture for natural language processingPublished by Association for Computing Machinery (ACM) ,2008
- Fast support vector machine training and classification on graphics processorsPublished by Association for Computing Machinery (ACM) ,2008
- How GPUs WorkComputer, 2007
- Best practices for convolutional neural networks applied to visual document analysisPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2005
- Convolutional face finder: a neural architecture for fast and robust face detectionPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2004
- GPU implementation of neural networksPattern Recognition, 2004
- Efficient BackPropPublished by Springer Nature ,1998
- Gradient-based learning applied to document recognitionProceedings of the IEEE, 1998