This repo contains GPU code written while learning CUDA, with an intent to demonstrate the value of GPU acceleration while creating a basis of code for accelerating matrix operations. This code was benchmarked on the XSEDE supercomputer for all final results.
To compile, simply run nvcc <filename>
CUDA By Example: an introduction to general-purpose GPU programming, by Jason Sanders and Edward Kandrot
An Even Easier Introduction to CUDA by Mark Harris