Performance and Scalability of GPU-Based Convolutional Neural Networks

Top Cited Papers

1 February 2010

conference paper
Published by Institute of Electrical and Electronics Engineers (IEEE)

No. 10666192,p. 317-324
https://doi.org/10.1109/pdp.2010.43

Abstract

In this paper we present the implementation of a framework for accelerating training and classification of arbitrary Convolutional Neural Networks (CNNs) on the GPU. CNNs are a derivative of standard Multilayer Perceptron (MLP) neural networks optimized for two-dimensional pattern recognition problems such as Optical Character Recognition (OCR) or face detection. We describe the basic parts of a CNN and demonstrate the performance and scalability improvement that can be achieved by shifting the computation-intensive tasks of a CNN to the GPU. Depending on the network topology training and classification on the GPU performs 2 to 24 times faster than on the CPU. Furthermore, the GPU version scales much better than the CPU implementation with respect to the network size.

Keywords

This publication has 12 references indexed in Scilit:

Chinese License Plate Recognition Using a Convolutional Neural Network
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
Processing Neocognitron of Face Recognition on High Performance Environment Based on GPU with CUDA Architecture
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2008
A unified architecture for natural language processing
Published by Association for Computing Machinery (ACM) ,2008
Fast support vector machine training and classification on graphics processors
Published by Association for Computing Machinery (ACM) ,2008
How GPUs Work
Computer, 2007
Best practices for convolutional neural networks applied to visual document analysis
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2005
Convolutional face finder: a neural architecture for fast and robust face detection
Published by Institute of Electrical and Electronics Engineers (IEEE) ,2004
GPU implementation of neural networks
Pattern Recognition, 2004
Efficient BackProp
Published by Springer Nature ,1998
Gradient-based learning applied to document recognition
Proceedings of the IEEE, 1998

Cited by 182 articles