Abstract
We address the problem that multicore DSP system doesn't support OpenCL programming. We designed compiler and proposed a runtime framework for TI multicore DSP, by which OpenCL parallel program could take advantage of multicore computing resource. Firstly, we make use of the LLVM and Clang compiler front-end to achieve source-to-source translation and in the next stage build translated kernel into the DSP dynamic module. Secondly, a new RTOS scheduler for kernel task is proposed to reduce context switch and enables switching between multiple work-item tasks. Finally we develop a software managed CACHE strategy to access distributed global memory in multiple DSP system with SRIO interconnections. The runtime effectively exposes to the user full computing resources in distributed multicore DSP for kernel execution. We evaluated the performance using some common OpenCL kernels from NVIDIA, NAS, AMD, and Parboil Benchmarks. Experimental results show that OpenCL application performs well in multicore DSP system. 1553-9105/
| Original language | English |
|---|---|
| Pages (from-to) | 4059-4070 |
| Number of pages | 12 |
| Journal | Journal of Computational Information Systems |
| Volume | 10 |
| Issue number | 10 |
| DOIs | |
| State | Published - 15 May 2014 |
Keywords
- DSP OpenCL
- Embedded computing platform
- Kernel scheduler
- OpenCL runtime
Fingerprint
Dive into the research topics of 'An OpenCL runtime and scheduler for embedded multicore DSP parallel systems'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver