An OpenCL runtime and scheduler for embedded multicore DSP parallel systems

Research output: Contribution to journalArticlepeer-review

Abstract

We address the problem that multicore DSP system doesn't support OpenCL programming. We designed compiler and proposed a runtime framework for TI multicore DSP, by which OpenCL parallel program could take advantage of multicore computing resource. Firstly, we make use of the LLVM and Clang compiler front-end to achieve source-to-source translation and in the next stage build translated kernel into the DSP dynamic module. Secondly, a new RTOS scheduler for kernel task is proposed to reduce context switch and enables switching between multiple work-item tasks. Finally we develop a software managed CACHE strategy to access distributed global memory in multiple DSP system with SRIO interconnections. The runtime effectively exposes to the user full computing resources in distributed multicore DSP for kernel execution. We evaluated the performance using some common OpenCL kernels from NVIDIA, NAS, AMD, and Parboil Benchmarks. Experimental results show that OpenCL application performs well in multicore DSP system. 1553-9105/

Original languageEnglish
Pages (from-to)4059-4070
Number of pages12
JournalJournal of Computational Information Systems
Volume10
Issue number10
DOIs
StatePublished - 15 May 2014

Keywords

  • DSP OpenCL
  • Embedded computing platform
  • Kernel scheduler
  • OpenCL runtime

Fingerprint

Dive into the research topics of 'An OpenCL runtime and scheduler for embedded multicore DSP parallel systems'. Together they form a unique fingerprint.

Cite this