Parallelism delivers the capability High Performance Computing (HPC) requires.
We optimized a version of Dijkstra’s shortest path graph algorithm using a combination of Intel® Cilk™ Plus array notation and OpenMP* parallel for.
The previous webinar gave examples of simply-structured loops that could be auto-vectorized using the Intel® Compiler.
In this episode, we will talk about some of the problems you might get while using automatic vectorization feature of Intel® Professional Edition Compilers.
We will discuss automatic vectorization feature of the compilers, where it can be used, and how to diagnose it. But this discussion will cover only basic principles of automatic vectorization.
Intel® Composer XE 2015 has dramatically overhauled the reporting features for such crucial optimizations as inlining, vectorization, parallelization, and memory access and cache usage optimization