Article

Don't Use the OpenMP* THREADPRIVATE 'Compatibility' Option when Everything is Compiled by Intel

The Intel C++ and Fortran compilers for Windows* and Linux* provide 'legacy' and 'compatibility' implementations of the OpenMP THREADPRIVATE directive. The 'compatibility' option should not be used when everything is compiled by Intel compilers.
Authored by Kenneth Craft (Intel) Last updated on 07/08/2019 - 15:12
Article

OpenMP* 4.0 Combined Offload Constructs Support for the Intel® Xeon Phi™ Coprocessor

The Intel® Parallel Studio XE 2015 Composer Editions for Windows* and Linux* have feature enhancements that provide near full support of the OpenMP* 4.0 API (July 2013) specification.

Authored by Last updated on 10/15/2019 - 15:30
Article

OMP_PROC_BIND is Now Supported on Compatible Non-Intel Processors

The newest versions of the Intel® C++ and Fortran compilers now support OpenMP* environment variable OMP_PROC_BIND on compatible non-Intel processors for Linux* and Windows* platfo

Authored by Kenneth Craft (Intel) Last updated on 10/08/2019 - 18:20
Article

New Features for Intel® MIC Architecture in the Intel Compiler

The list below summarizes new features and changes specific to programming for Intel® MIC Architecture with Intel Compiler 15.0:

Authored by AmandaS (Intel) Last updated on 10/15/2019 - 16:40
Article

FAT Binary Created for Windows* Static (.lib) and Dynamic-Link (.dll) Library Containing Intel® Xeon Phi™ Coprocessor Offload Code

The Intel® Parallel Studio XE 2015 Composer Editions for Windows* have an internal implementation feature enhancement enabling the Intel® 15.0 compilers to create a FAT binary file for the Windows*

Authored by Last updated on 10/15/2019 - 15:30
Article

Run-to-Run Reproducibility of Floating-Point Calculations for Applications on Intel® Xeon Phi™ Coprocessors (and Intel® Xeon® Processors)

The Issue

If I rerun the identical program on the identical input data on an identical processor, will I get an identical result?

Authored by Martyn Corden (Intel) Last updated on 10/15/2019 - 15:30
Article

ELF Executable Binary File Created for the Intel® Xeon Phi™ Coprocessor Offload Image

The Intel® Parallel Studio XE 2015 Composer Editions for Windows* and Linux* have an internal implementation feature enhancement to the Intel® Xeon Phi™ coprocessor binary image created when using

Authored by Last updated on 10/15/2019 - 15:30
Article

Explicit Vector Programming in Fortran

No longer does Moore’s Law result in higher frequencies and improved scalar application performance; instead, higher transistor counts lead to increased parallelism, both through more cores and thr

Authored by Martyn Corden (Intel) Last updated on 10/15/2019 - 15:30
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Authored by admin Last updated on 07/05/2019 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Authored by admin Last updated on 07/05/2019 - 19:52