Professional Documents
Culture Documents
GPU??
graphics
processing unit
2
Background
In just few years the GPUs had a great evolution
evolution.
Problems:
-the GPU could only be programmed using graphics API;
-the GPU DRAM could be read in a general way but could not be written in a
general way;
-some applications were bottlenecked by the DRAM memory bandwidth;
3
Hardware and software
4
Access memory
CUDA provides general DRAM memory addressing both for scatter and gather
memory operations, just like on a CPU.
Gather
Scatter
5
Shared Memory
In addition it features a parallel data cache/on-chip shared memory with a very fast
general read and write access, used by
g y threads to share data with each other.
6
Programming Model & Hardware Implementation
A kernel
k l is
i a function
f ti that
th t is
i executed
t d
on the graphic device as many different
threads.
7
Programming Model & Hardware Implementation
8
Programming Model & Hardware Implementation
9
Application Programming Interface
10
Optimization guidelines
11
Bibliography
- en.wikipedia.org/wiki/CUDA
- en.wikipedia.org/wiki/GPGPU
- www.gpgpu.org/developer/
www gpgpu org/developer/
12