Article

Programming for Multicore and Many-core Products including Intel® Xeon® processors and Intel® Xeon Phi™ X100 Product Family coprocessors

The programming models in use today, used for multicore processors every day, are available for many-core coprocessors as well. Therefore, explaining how to program both Intel Xeon processors and Intel Xeon Phi coprocessor is best done by explaining the options for parallel programming. This paper provides the foundation for understanding how multicore processors and many-core coprocessors are...
Authored by James R. (Blackbelt) Last updated on 06/14/2019 - 12:10
Article

OpenCL™ Platform/Device Capabilities Viewer Sample

Download for Windows*

Authored by Last updated on 05/31/2019 - 14:10
Article

General Matrix Multiply Sample

General Matrix Multiply (GEMM) sample demonstrates how to efficiently utilize an OpenCL™ device to perform general matrix multiply operation on two dense square matrices. The primary target devices that are suitable for this sample are the devices with cache memory: Intel® Xeon Phi™ and Intel® Architecture CPU devices.
Authored by Last updated on 05/31/2019 - 14:40
Article

Median Filter

The sample demonstrates how to implement efficient median filter with OpenCL™ standard. This implementation relies on auto-vectorization performed by Intel® SDK for OpenCL Applications compiler.
Authored by Last updated on 05/31/2019 - 14:40
Article

HDR Rendering with God Rays Using OpenCL™ Technology

This sample demonstrates a CPU-optimized implementation of the God Rays effect, showing how to: Implement calculation kernels using the OpenCL™ technology C99 Parallelize the kernels by running several work-groups in parallel Organize data exchange between the host and the OpenCL device
Authored by Last updated on 05/31/2019 - 14:10
Article

Bitonic Sorting

Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.
Authored by Last updated on 05/31/2019 - 14:40
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Authored by Last updated on 05/31/2019 - 14:10
Article

Using the Intel® SDK for OpenCL™ Applications XE 2013 with the Intel® Xeon Phi™ Coprocessor

Chapter 1 – Introduction

 

Authored by Sumedh N. (Intel) Last updated on 07/06/2019 - 16:30
Article

Using Basic Capabilities of Multi-Device Systems with OpenCL™

Download for Windows*

Authored by Last updated on 05/31/2019 - 14:10