Tips and techniques on using the Intel® Compilers to maximize your application performance.
Intel® Compiler - How can I generate optimized code to run on any IA-32 or Intel®64 architecture processor?Some frequently used optimization switches of the Intel Compiler are described
Performance Tools for Software Developers - SSE generation and processor-specific optimizations continuedCan I combine the processor values and target more than one processor? How to generate optimized code for both Intel and AMD* architecture? Where can I find more information on processor-specific optimizations?
The purpose of this document is to help developers determine which FFT, Intel® MKL or Intel® IPP is best suited for their application.
Vectorization is one of many optimizations that are enabled by default in the latest Intel compilers. In order to be vectorized, loops must obey certain conditions, listed below. Some additional ways to help the compiler to vectorize loops are described.
The compiler supports many options that tune or optimize an application for different Intel and non-Intel processors. Differences are explained, and the switches /arch, /Qx..., /Qax... (Windows*) and -m, -x..., -ax... (Linux*, Mac OS* X) are recommended.
SIGSEGV on Linux and SIGBUS on MacOS Root Causes
Advice and background information is given on typical issues that may arise when threading an application using the Intel Fortran Compiler and other software tools, whether using OpenMP, automatic parallelization or threaded libraries.
The Intel® C++ Compiler 11.1 Professional Edition now allows you to merge .dyn files with customized weighting.
The multi-core performance of a legacy Fortran benchmark unsuited to data parallelism is enhanced by threading using the TASK construct of OpenMP and the Intel Fortran Compiler. The necessary source code changes are explained in detail.