Download OpenCL* Device Fission for CPU Performance [PDF 762KB]
Programming for Multicore and Many-core Products including Intel® Xeon® processors and Intel® Xeon Phi™ X100 Product Family coprocessorsThe programming models in use today, used for multicore processors every day, are available for many-core coprocessors as well. Therefore, explaining how to program both Intel Xeon processors and Intel Xeon Phi coprocessor is best done by explaining the options for parallel programming. This paper provides the foundation for understanding how multicore processors and many-core coprocessors are...
About this document
The Intel® Xeon Phi™ Coprocessor is designed for highly parallel, high performance demanding applications.
Download for Windows*
General Matrix Multiply (GEMM) sample demonstrates how to efficiently utilize an OpenCL™ device to perform general matrix multiply operation on two dense square matrices. The primary target devices that are suitable for this sample are the devices with cache memory: Intel® Xeon Phi™ and Intel® Architecture CPU devices.
Intel® SDK for OpenCL* Applications XE 2013 Release Notes Content
The sample demonstrates how to implement efficient median filter with OpenCL™ standard. This implementation relies on auto-vectorization performed by Intel® SDK for OpenCL Applications compiler.
This sample demonstrates a CPU-optimized implementation of the God Rays effect, showing how to: Implement calculation kernels using the OpenCL™ technology C99 Parallelize the kernels by running several work-groups in parallel Organize data exchange between the host and the OpenCL device
Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.