Filters

Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Authored by Last updated on 05/31/2019 - 14:10
Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Authored by Robert I. (Intel) Last updated on 05/31/2019 - 14:20
Article
Article

Using SPIR for fun and profit with Intel® OpenCL™ Code Builder

This short tutorial provides a brief introduction to Khronos SPIR. It touches on the differences between a SPIR binary and an Intel proprietary Intermediate Binary, demonstrates ways to create SPIR binaries using tools shipped with Intel® SDK for OpenCL™ Applications , and explains how to use SPIR binaries in your OpenCL program.
Authored by Robert I. (Intel) Last updated on 05/31/2019 - 14:20
Article

SGEMM for Intel® Processor Graphics

Introduction

General Matrix Multiply

cl_intel_subgroups Extension

Authored by Last updated on 05/17/2019 - 12:00
Article

GPU-Quicksort в OpenCL 2.0: вложенные параллельные вычисления и сканирование групп обработки

Введение Краткий курс истории алгоритма быстрой сортировки
Authored by Last updated on 05/31/2019 - 14:20
Article

Cómo compartir superficies entre OpenCL™ y DirectX* 11 en Intel® Processor Graphics

Descargar PDF

Authored by Last updated on 05/31/2019 - 15:10
Article
Article

Introduction to GEN Assembly

Authored by Robert I. (Intel) Last updated on 05/17/2019 - 12:00