The overhead linked to offloading work to an accelerator can be problematic, especially for short-running device kernels. Fusing multiple smaller kernels into one can be a solution to this problem, but manual implementation of fused kernels is tedious work, as it needs to be repeated for each potential combination of kernels. Codeplay have therefore developed an extension for the SYCL standard for user-driven, automatic kernel fusion. If you want to learn how to instruct the SYCL runtime to perform kernel fusion automatically for you, look no further and dive into this blog-post, which explains the extension and demonstrates its use on a simple example.
User-driven Kernel Fusion!
7
April
2023
Similar Updates
Details
Shared By
Rod Burns
Shared Date
Apr 7, 2023, 12:12:48 PM

Your Privacy
We genuinely value your privacy and only store data that you are comfortable with.
You can view and read our storage and privacy policies below. If you have any questions, please feel free to reach out to us via the contact details on the privacy policy.