Article

Bitonic Sorting

Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.
Authored by Last updated on 05/31/2019 - 14:40
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Authored by Last updated on 05/31/2019 - 14:10
Article

Using Basic Capabilities of Multi-Device Systems with OpenCL™

Download for Windows*

Authored by Last updated on 05/31/2019 - 14:10
Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Authored by Robert I. (Intel) Last updated on 05/31/2019 - 14:20
Article

Sierpiński Carpet in OpenCL* 2.0

We demonstrate how to create a Sierpinski Carpet in OpenCL* 2.0

Authored by Robert I. (Intel) Last updated on 05/31/2019 - 14:20
Article

The Generic Address Space in OpenCL™ 2.0

Introduction What is the Generic Address Space?
Authored by Adam Lake (Intel) Last updated on 07/03/2019 - 10:34
Article

Using OpenCL™ 2.0 Read-Write Images

While Image convolution is not as effective with the new Read-Write images functionality, any image processing technique that needs be done in place may benefit from the Read-Write images. One example of a process that could be used effectively is image composition. In OpenCL 1.2 and earlier, images were qualified with the “__read_only” and __write_only” qualifiers. In the OpenCL 2.0, images can...
Authored by Last updated on 05/31/2019 - 14:20
Blog post

opencl_node basic interfaces and opencl_buffer

This post continues a series of articles that describes the opencl_node, a new node available in the Intel® Threading Building Blocks (Intel® TBB) library since version 4.4 Update 2.

Authored by Alex (Intel) Last updated on 05/30/2018 - 07:08
Article

A Runtime Generated FFT for Intel® Processor Graphics

Download the code

Authored by Dan Petre (Intel) Last updated on 07/06/2019 - 20:30
Article

SPIR-V is a better SPIR with Intel® OpenCL™ Code Builder

Download the pdf version of the article

Authored by Robert I. (Intel) Last updated on 07/08/2019 - 14:19