Article

OMP_PROC_BIND is Now Supported on Compatible Non-Intel Processors

The newest versions of the Intel® C++ and Fortran compilers now support OpenMP* environment variable OMP_PROC_BIND on compatible non-Intel processors for Linux* and Windows* platfo

Authored by Kenneth Craft (Intel) Last updated on 07/12/2019 - 15:35
Article

MKL VSL example for grouppooledcovariance (vsldgrouppooledcovariance.f) failed with PGI threading layer.

Problem Description:

Grouppooledcovariance example crashs with segmentation fault with PGI threading layer on all platforms due to error in PGI support of OpenMP order clause.

Authored by Gennady F. (Blackbelt) Last updated on 03/27/2019 - 10:00
Article

Updated Support for OpenMP* 4.0 features added in Composer XE 2013 SP1

[Updated based on the version of update 2 of 2013 SP1]

Authored by Last updated on 03/09/2019 - 12:30
Article

Diagnostic 15527: loop was not vectorized: function call to xxx cannot be vectorized

Product Version: Intel(R) Visual Fortran Compiler XE 15.0 or a later version

Authored by Devorah H. (Intel) Last updated on 05/25/2018 - 15:30
Article

Using Intel® MKL and Intel® TBB in the same application

Intel MKL 11.3 has introduced Intel TBB support.

Authored by Gennady F. (Blackbelt) Last updated on 08/01/2019 - 09:22
Article

Books - High Performance Parallelism Pearls

A look into the contents of the two "Pearls" books, edited by James Reinders and Jim Jeffers. These books contain a collection of examples of code modernization.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Blog post

What is Thread Parallelism, and How Do I Put It to Use?

An Intro to Multi-Level Parallelism for High-Performance Computing by Clay Breshears | Life Sciences Software Architect, Intel
Authored by Clay B. (Blackbelt) Last updated on 12/12/2018 - 18:08
Blog post

Vectorized Reduction 2: Let the Compiler do that Voodoo that it do so well

As I mentioned in my previous post about writing a vectorized reduction code from Intel vector intrinsics, that part of the code was just the finishing touch on a loop computing squared difference of complex values.
Authored by Clay B. (Blackbelt) Last updated on 12/12/2018 - 18:08
Article

Weather Research and Forecasting Model Optimized for Knights Landing

The Weather Research and Forecasting (WRF) Model is a numerical weather prediction (NWP) system designed for both atmospheric research and operational forecasting needs. It is made up of about a half million lines of code, predominantly in Fortran*.
Authored by Last updated on 03/21/2019 - 12:00