Filtros

Article

Workshop: Optimizing OpenCL applications for Intel® Xeon Phi™ Coprocessor

The Intel® Xeon Phi™ Coprocessor is designed for highly parallel, high performance demanding applications.

Autor Arik Narkis (Intel) Última actualización 06/07/2019 - 16:30
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Autor Última actualización 31/05/2019 - 14:10
Article
Video

Optimizing Simple OpenCL Kernels: Sobel Kernel Optimization

Robert Ioffe describes a consistent series of optimizations that improve OpenCL kernel performance on Intel®

Autor Robert I. (Intel) Última actualización 06/07/2019 - 20:30
Video

Optimizing Simple OpenCL™ Kernels: Modulate Kernel Optimization

Robert Ioffe describes a consistent series of optimizations that improve OpenCL kernel performance on Intel®

Autor Robert I. (Intel) Última actualización 06/07/2019 - 20:30
Article

GPU-Quicksort in OpenCL 2.0: Nested Parallelism and Work-Group Scan Functions

Introduction A Brief History of Quicksort
Autor Robert I. (Intel) Última actualización 31/05/2019 - 14:20
Article

Using Intel® SDK for OpenCL™ Applications 2015 to Accelerate Your Software

Introduction

Autor Robert I. (Intel) Última actualización 31/05/2019 - 14:20
Article