Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Authored by Last updated on 05/31/2019 - 14:10
Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Authored by Robert I. (Intel) Last updated on 05/31/2019 - 14:20
Article

GPU-Quicksort в OpenCL 2.0: вложенные параллельные вычисления и сканирование групп обработки

Введение Краткий курс истории алгоритма быстрой сортировки
Authored by Last updated on 05/31/2019 - 14:20
Article
Article

A Runtime Generated FFT for Intel® Processor Graphics

Download the code

Authored by Dan Petre (Intel) Last updated on 07/06/2019 - 20:30