We received many user’s requests asking “how to call AVX code into IPP 6.1 and all latest updates of this version”.
So, the main purpose of this short article is to tell how to do this.
Intel® Advanced Vector Extensions
Improving the Compute Performance of Video Processing Software Using AVX (Advanced Vector Extensions) Instructions (by Eli Hernandez and Larry Moore)
This paper describes a case study in which AVX instructions are used to enhance the performance of a de-saturation algorithm (a common video filter). The case study takes the algorithm from a non-SIMD state to AVX based SIMD.
Which applications are most likely to benefit from recompilation for Intel® Advanced Vector Extensions (Intel® AVX)?
Applications that spend considerable time in floating-point loops that can be vectorized are likely to benefit the most from the increased vector width of Intel® Advanced Vector Instructions (Intel® AVX).
SOA Cloth Simulation with 256-bit Intel® Advanced Vector Extensions (Intel® AVX)
This white paper describes a code sample that uses Intel® AVX for computing mesh-based cloth simulation. A structure of arrays (SOA) implementation is used to maximize data parallelism enabling usage of 256-bit (8 float) SIMD processing. Code is provided.
在不编写 AVX 代码的情况下使用 AVX
Using AVX Without Writing AVX Code (PDF 260KB)
摘要
英特尔® 高级矢量扩展指令集(AVX)是一套针对英特尔® SIMD 流指令扩展(英特尔® SSE)的 256 位扩展指令集,专为浮点密集型应用而设计。英特尔® SSE 和英特尔® AVX 均为单指令多数据指令集的示例。英特尔® AVX 作为第二代英特尔® 酷睿™ 处理器家族的一部分发布。英特尔® AVX 采用更宽的 256 位矢量 - 一种全新的扩展指令格式(矢量扩展指令集或 VEX)并具备丰富的功能,使系统性能得到显著提升。
该指令集架构支持三种操作数,可提升指令编程灵活性,并支持非破坏性的源操作数。传统的 128 位 SIMD 指令也经过了扩展,支持三种操作数和新的指令加密格式 (VEX)。指令加密格式介绍了使用操作码和前缀,以处理器能够理解的格式来表达更高级别的指令的方式。这有助于实现对数据和一般应用的更好的管理,例如图像、音视频处理、科研模拟、金融分析和三维建模与分析。
Benefits of Intel® Advanced Vector Extensions For Quaternion Spherical Linear Interpolation (Slerp)
Intel® AVX is a 256 bit instruction set extension to Intel® SSE and is designed for floating point intensive applications. This article examines how Intel® AVX micro architecture features benefit Spherical Linear Interpolation (SLERP) opserations.
Which applications are most likely to benefit from recompilation for Intel® Advanced Vector Extensions (Intel® AVX)?
Applications containing vectorizable, floating-point loops or calls to performance libraries are the most likely to see significant performance gains from rebuilding for Intel® Advanced Vector Extensions (Intel® AVX)
Image Processing Acceleration Techniques using Intel® Streaming SIMD Extensions and Intel® Advanced Vector Extensions
This article details optimized implementations of data transformations and algorithms together with analysis comparing performance and providing speedup measurements for Intel® SSE optimized code and estimates for Intel® AVX optimized code.
IIR Gaussian Blur Filter Implementation using Intel® Advanced Vector Extensions
This white paper proposes an implementation for the Infinite Impulse Response (IIR) Gaussian blur filter using Intel® Advanced Vector Extensions (Intel® AVX) instructions. For a 2048x2048 image size, the AVX implementation is ~2X faster than the SSE code.
