How to Vectorize Code Using Intrinsics on 32-Bit Intel® Architecture

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
IIR Gaussian Blur Filter Implementation using Intel® Advanced Vector Extensions

This white paper proposes an implementation for the Infinite Impulse Response (IIR) Gaussian blur filter using Intel® Advanced Vector Extensions (Intel® AVX) instructions. For a 2048x2048 image size, the AVX implementation is ~2X faster than the SSE code.
3D Vector Normalization Using 256-Bit Intel® Advanced Vector Extensions (Intel® AVX)

This article shows how to use 256-bit Intel® Advanced Vector Extensions (Intel® AVX) to normalize an array of 3D vectors. We describe a shuffle approach to convert between AOS & SOA on-the-fly in order to make data ready for up to 8-wide SIMD processing.
Auto vectorization of OpenCL* code with the Intel® SDK for OpenCL* Applications

The Intel® SDK for OpenCL* Applications features an implicit vectorization module which boosts application performance. The implicit vectorization module uses state-of-the-art vectorization algorithms based on up-to-date compiler research
Vectorization Toolkit

A toolkit that gives 6 Steps to Increase Performance Through Vectorization in Your Application
英特尔向量化工具箱:5. 实现GAP的建议

英特尔向量化工具箱:5. 实现GAP的建议(如pragma simd和/或数组表示法array notations)
英特尔向量化工具箱:3. 使用英特尔编译器的向量化报告确定候选循环

英特尔向量化工具箱:3. 使用英特尔编译器的向量化报告确定候选循环
Fortran Array Data and Arguments and Vectorization

Examples of vectorizing Fortran applications
Outer Loop Vectorization

Vectorization Essentials: Vectorizing the outer loop can be profitable
