Article
Article

OpenCL™ Platform/Device Capabilities Viewer Sample

Download for Windows*

Autor Última actualización 31/05/2019 - 14:10
Article

General Matrix Multiply Sample

General Matrix Multiply (GEMM) sample demonstrates how to efficiently utilize an OpenCL™ device to perform general matrix multiply operation on two dense square matrices. The primary target devices that are suitable for this sample are the devices with cache memory: Intel® Xeon Phi™ and Intel® Architecture CPU devices.
Autor Última actualización 31/05/2019 - 14:40
Article

Median Filter

The sample demonstrates how to implement efficient median filter with OpenCL™ standard. This implementation relies on auto-vectorization performed by Intel® SDK for OpenCL Applications compiler.
Autor Última actualización 31/05/2019 - 14:40
Article

HDR Rendering with God Rays Using OpenCL™ Technology

This sample demonstrates a CPU-optimized implementation of the God Rays effect, showing how to: Implement calculation kernels using the OpenCL™ technology C99 Parallelize the kernels by running several work-groups in parallel Organize data exchange between the host and the OpenCL device
Autor Última actualización 31/05/2019 - 14:10
Article

Bitonic Sorting

Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.
Autor Última actualización 31/05/2019 - 14:40
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Autor Última actualización 31/05/2019 - 14:10
Mensajes en el blog

Applying Intel® Threading Building Blocks Observers for Thread Affinity on Intel® Xeon Phi™ Coprocessors

In spite of the fact that the Intel® Threading Building Blocks (Intel® TBB) library [1] [2] provides high-level task based parallelism intended to hide sof

Autor Alex (Intel) Última actualización 01/08/2019 - 09:30
Article

Using Basic Capabilities of Multi-Device Systems with OpenCL™

Download for Windows*

Autor Última actualización 31/05/2019 - 14:10
Article

Improve Vectorization Performance with Intel® AVX-512

See how the new Intel® Advanced Vector Extensions 512CD and the Intel AVX512F subsets (available in the Intel® Xeon Phi processor and in future Intel Xeon processors) lets the compiler automatically generate vector code with no changes to the code.
Autor Alberto V. (Intel) Última actualización 08/07/2019 - 19:26