With automatic parallelization, the compiler detects loops that can be safely and efficiently executed in parallel and generates multithreaded code.
A step-by-step introduction to application performance tuning using the Intel® Compilers version 13 for IA-32 and Intel® 64 processors that are included with Intel® Parallel Studio XE 2013
This article describes a method to compile and run a distributed memory coarray program using Intel® Parallel Studio XE Cluster Edition for Linux . An example using Linux* is presented.
The Intel C++ and Fortran compilers for Windows* and Linux* provide 'legacy' and 'compatibility' implementations of the OpenMP THREADPRIVATE directive. The 'compatibility' option should not be used when everything is compiled by Intel compilers.
Intel MKL in Microsoft Visual Studio
The specific optimization and general support for the latest Intel® AVX2 instructions have been added in the Intel MKL v11.0. This article lists the specific functions that are optimized for Intel AVX2.
Late-breaking updates to the Intel(R) C++ and Fortran Composer XE 2013 documentation specific to the Intel(R) Many Integrated Core (Intel(R) MIC) architecture will be shown here.
The article describes how to profile Windows* services by launching them from Intel® VTune™ Amplifier. This trick is useful for cases when attaching to process is not applicable.
Sparse BLAS routines can be useful to implement iterative methods for solving large sparse systems of equations or eigenvalue problems