Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Автор: Robert I. (Intel) Последнее обновление: 31.05.2019 - 14:20
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Автор: Последнее обновление: 15.10.2019 - 16:50