Filters

Article

英特尔® 高级矢量扩展指令集简介

英特尔® 高级矢量扩展指令集(英特尔® AVX)是在英特尔® 架构 CPU 上执行单指令多数据 (SIMD) 运算的指令集,这些指令扩展了之前的 SIMD 产品(MMX™ 指令和英特尔® SSE),本文概述了 AVX 指令集。
Authored by admin Last updated on 03/16/2017 - 06:46
Article

OpenCL™ Technology and Intel® Media SDK Interoperability

The Intel® Media SDK Interoperability sample demonstrates how to use Intel® Media SDK and OpenCL™ technology together for efficient video decoding and fast post-processing.
Authored by Vadim Kartoshkin (Intel) Last updated on 03/08/2017 - 10:56
Article

Copying Accelerated Video Decode Frame Buffers

Conventional memory copying methods can yield very poor performance when data is in "uncacheable" memory, as when video decode is being done with hardware acceleration. This paper explains useful data copy optimizations for video decode operations.
Authored by THOMAS C. (Intel) Last updated on 03/07/2017 - 14:50
Article

Courseware - Visualization

Basic viewing and interrogation functions for visualization Visualization of vector fields, tensors, and flow data
Authored by admin Last updated on 02/14/2017 - 14:08
Article

General Matrix Multiply Sample

General Matrix Multiply (GEMM) sample demonstrates how to efficiently utilize an OpenCL™ device to perform general matrix multiply operation on two dense square matrices. The primary target devices that are suitable for this sample are the devices with cache memory: Intel® Xeon Phi™ and Intel® Architecture CPU devices.
Authored by Vadim Kartoshkin (Intel) Last updated on 01/26/2017 - 00:49
Article

Cross-Device NBody Simulation Sample

This sample illustrates the basic principles of how to work simultaneously with OpenCL™ devices on both CPU and Intel® Processor Graphics.
Authored by Esther Lederer (Intel) Last updated on 01/26/2017 - 00:49
Article

Median Filter

The sample demonstrates how to implement efficient median filter with OpenCL™ standard. This implementation relies on auto-vectorization performed by Intel® SDK for OpenCL Applications compiler.
Authored by Vadim Kartoshkin (Intel) Last updated on 01/26/2017 - 00:49
Article

Bitonic Sorting

Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.
Authored by Vadim Kartoshkin (Intel) Last updated on 01/26/2017 - 00:49
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Authored by Esther Lederer (Intel) Last updated on 01/26/2017 - 00:49
Article

Hybrid HDR Tone Mapping for Post Processing Multi-Device Version

HDR Tone Mapping for Post Processing sample features multi-device support, specifically the simultaneous use of CPU and Intel® Processor Graphics OpenCL™ devices.
Authored by Esther Lederer (Intel) Last updated on 01/26/2017 - 00:49
For more complete information about compiler optimizations, see our Optimization Notice.