Фильтры

Article

Optimizing Correlation Analysis of Financial Market Data Streams Using Intel® Math Kernel Library

Download Source Code
Автор: Zhang, Zhang (Intel) Последнее обновление: 06.07.2019 - 16:30
Article

Fortran vs. C Offload Directives and Functions

This is a "cheatsheet" comparing the Fortran and C++ offload directives and functions in the context of programming for the Intel® Xeon Phi™ coprocessor

Автор: Belinda Liviero (Intel) Последнее обновление: 08.07.2019 - 15:02
Article

Many Faces of Parallelism

Many Faces of Parallelism: Porting Programs to the Intel® Many Integrated Core Architecture

Автор: админ Последнее обновление: 21.03.2019 - 12:00
Article

OpenCL™ Platform/Device Capabilities Viewer Sample

Download for Windows*

Автор: Последнее обновление: 31.05.2019 - 14:10
Article

General Matrix Multiply Sample

General Matrix Multiply (GEMM) sample demonstrates how to efficiently utilize an OpenCL™ device to perform general matrix multiply operation on two dense square matrices. The primary target devices that are suitable for this sample are the devices with cache memory: Intel® Xeon Phi™ and Intel® Architecture CPU devices.
Автор: Последнее обновление: 31.05.2019 - 14:40
Article

Median Filter

The sample demonstrates how to implement efficient median filter with OpenCL™ standard. This implementation relies on auto-vectorization performed by Intel® SDK for OpenCL Applications compiler.
Автор: Последнее обновление: 31.05.2019 - 14:40
Article

HDR Rendering with God Rays Using OpenCL™ Technology

This sample demonstrates a CPU-optimized implementation of the God Rays effect, showing how to: Implement calculation kernels using the OpenCL™ technology C99 Parallelize the kernels by running several work-groups in parallel Organize data exchange between the host and the OpenCL device
Автор: Последнее обновление: 31.05.2019 - 14:10
Article

Bitonic Sorting

Demonstrates how to implement an efficient sorting routine with the OpenCL™ technology that operates on arbitrary input array of integer values. The sample uses properties of bitonic sequence and principles of sorting networks and enables efficient SIMD-style parallelism through OpenCL vector data types. The code is designed to work well on modern CPUs.
Автор: Последнее обновление: 31.05.2019 - 14:40
Article

Simple Optimizations of OpenCL™ Code

Simple Optimizations sample demonstrates simple ways of measuring the performance of OpenCL™ kernels in an application. It describes basics of profiling and important caveats like having dedicated “warming” run. It also demonstrates several simple optimizations, some of optimizations are rather CPU-specific (like mapping buffers), while others are more general (like using relaxed-math). The...
Автор: Последнее обновление: 31.05.2019 - 14:10
Блоги

Applying Intel® Threading Building Blocks Observers for Thread Affinity on Intel® Xeon Phi™ Coprocessors

In spite of the fact that the Intel® Threading Building Blocks (Intel® TBB) library [1] [2] provides high-level task based parallelism intended to hide sof

Автор: Alex (Intel) Последнее обновление: 01.08.2019 - 09:30