Performance Optimization on GPGPU & Multicore CPU using Roofline Model: A Recent Study
The roofline model is used in this chapter to find the best platform for training a neural network to detect handwritten digits in a multicore CPU and general purpose GPU (GPGPU) hardware environment. The pattern parallel training approach is utilised for the MNIST dataset. On multicore CPU and GPGPU,...
Read More