TY - GEN
T1 - An OpenCL micro-benchmark suite for GPUs and CPUs
AU - Yan, Xin
AU - Shi, Xiaohua
AU - Sun, Qingyue
PY - 2012
Y1 - 2012
N2 - OpenCL (Open Computing Language) is the first open, royalty-free standard for cross-platform, parallel programming of modern processors in personal computers, servers and handheld/embedded devices. OpenCL is vendor-independent and hence not specialized for any particular compute device. In order to develop efficient OpenCL applications for the particular platform, we still need a more profound understanding of the architecture features on the OpenCL model and computing devices. For this purpose, we design and implement an OpenCL micro-benchmark suite for GPUs and CPUs. We introduce the implementations of our OpenCL micro benchmarks and present the performance results of hardware and software features like the bus bandwidth, memory architectures, branch architectures and thread hierarchy, etc., evaluated by our micro benchmarks on multi-core X86 CPU and NVIDIA's GPU.
AB - OpenCL (Open Computing Language) is the first open, royalty-free standard for cross-platform, parallel programming of modern processors in personal computers, servers and handheld/embedded devices. OpenCL is vendor-independent and hence not specialized for any particular compute device. In order to develop efficient OpenCL applications for the particular platform, we still need a more profound understanding of the architecture features on the OpenCL model and computing devices. For this purpose, we design and implement an OpenCL micro-benchmark suite for GPUs and CPUs. We introduce the implementations of our OpenCL micro benchmarks and present the performance results of hardware and software features like the bus bandwidth, memory architectures, branch architectures and thread hierarchy, etc., evaluated by our micro benchmarks on multi-core X86 CPU and NVIDIA's GPU.
KW - GPU
KW - OpenCL
KW - micro benchmark
KW - multi-core CPU
UR - https://www.scopus.com/pages/publications/84884624440
U2 - 10.1109/PDCAT.2012.52
DO - 10.1109/PDCAT.2012.52
M3 - 会议稿件
AN - SCOPUS:84884624440
SN - 9780769548791
T3 - Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
SP - 53
EP - 58
BT - Proceedings - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
T2 - 13th International Conference on Parallel and Distributed Computing, Applications, and Technologies, PDCAT 2012
Y2 - 14 December 2012 through 16 December 2012
ER -