A simple, widely known and studied problem was posed to the class students: matrix multiplication. We made an internal contest, which was to obtain the fastest serial code in which the students learned a lot about compiler optimizations, and even more, the effect of caches in code performance. The objective of the contest was to extrapoloate this exercise into a massive multicore architecture. Students were given kickstart code with a naive C using an OpenMP implemention of the problem, and a series of rules.
Use the Intel Compiler 11.1 or 12.0 with the switch /QxAVX (Windows*) or -xavx (Linux*) to compile applications for Intel® Advanced Vector Extensions (Intel® AVX).
This article provides an overview of the methods available in Intel® Parallel Composer, along with a comparison of their key benefits.
Tips and techniques on using the Intel Compilers to maximize your application performance.
How to use the automatic vectorizer of the Intel® C/C++ Compiler to optimize your application using Intel Streaming SIMD Extensions (Intel SSE) or Intel Advanced Vector Extensions (Intel AVX).
Installing Intel® Compilers for Windows* on computers without internet access by using license files or remote activation.
This article details optimized implementations of data transformations and algorithms together with analysis comparing performance and providing speedup measurements for Intel® SSE optimized code and estimates for Intel® AVX optimized code.