Improve Vectorization Performance using Intel® Advanced Vector Extensions 512

This article shows a simple example of a loop that was not vectorized by the Intel® C++ Compiler due to possible data dependencies, but which has now been vectorized using the Intel® Advanced Vector Extensions 512 instruction set on an Intel® Xeon Phi™ processor. We will explore why the compiler using this instruction set automatically recognizes the loop as vectorizable and will discuss some...
Authored by Alberto V. (Intel) Last updated on 12/15/2016 - 14:24
For more complete information about compiler optimizations, see our Optimization Notice.