TY - JOUR
T1 - An OpenCL micro-benchmark suite for GPUs and CPUs
AU - Yan, Xin
AU - Shi, Xiaohua
AU - Wang, Lina
AU - Yang, Haiyan
N1 - Publisher Copyright:
© 2014, Springer Science+Business Media New York.
PY - 2014/8/1
Y1 - 2014/8/1
N2 - Open computing language (OpenCL) is a new industry standard for task-parallel and data-parallel heterogeneous computing on a variety of modern CPUs, GPUs, DSPs, and other microprocessor designs. OpenCL is vendor independent and hence not specialized for any particular compute device. To develop efficient OpenCL applications for the particular platform, we still need a more profound understanding of architecture features on the OpenCL model and computing devices. For this purpose, we design and implement an OpenCL micro-benchmark suite for GPUs and CPUs. In this paper, we introduce the implementations of our OpenCL micro benchmarks, and present the measuring results of hardware and software features like performance of mathematical operations, bus bandwidths, memory architectures, branch synchronizations and scalability, etc., on two multi-core CPUs, i.e. AMD Athlon II X2 250 and Intel Pentium Dual-Core E5400, and two different GPUs, i.e. NVIDIA GeForce GTX 460se and AMD Radeon HD 6850. We also compared the measuring results with existing benchmarks to demonstrate the reasonableness and correctness of our benchmark suite.
AB - Open computing language (OpenCL) is a new industry standard for task-parallel and data-parallel heterogeneous computing on a variety of modern CPUs, GPUs, DSPs, and other microprocessor designs. OpenCL is vendor independent and hence not specialized for any particular compute device. To develop efficient OpenCL applications for the particular platform, we still need a more profound understanding of architecture features on the OpenCL model and computing devices. For this purpose, we design and implement an OpenCL micro-benchmark suite for GPUs and CPUs. In this paper, we introduce the implementations of our OpenCL micro benchmarks, and present the measuring results of hardware and software features like performance of mathematical operations, bus bandwidths, memory architectures, branch synchronizations and scalability, etc., on two multi-core CPUs, i.e. AMD Athlon II X2 250 and Intel Pentium Dual-Core E5400, and two different GPUs, i.e. NVIDIA GeForce GTX 460se and AMD Radeon HD 6850. We also compared the measuring results with existing benchmarks to demonstrate the reasonableness and correctness of our benchmark suite.
KW - GPU
KW - Micro benchmark
KW - Multi-core CPU
KW - OpenCL
UR - https://www.scopus.com/pages/publications/84958103347
U2 - 10.1007/s11227-014-1112-2
DO - 10.1007/s11227-014-1112-2
M3 - 文章
AN - SCOPUS:84958103347
SN - 0920-8542
VL - 69
SP - 693
EP - 713
JO - Journal of Supercomputing
JF - Journal of Supercomputing
IS - 2
ER -