Filters

Article

Threading Fortran Applications for Parallel Performance on Multi-Core Systems

Advice and background information is given on typical issues that may arise when threading an application using the Intel Fortran Compiler and other software tools, whether using OpenMP, automatic parallelization or threaded libraries.
Authored by Martyn Corden (Intel) Last updated on 12/12/2018 - 18:00
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Blog post

Optimization of Data Read/Write in a Parallel Application

(This work was done by Vivek Lingegowda during his internship at Intel.)

Authored by Last updated on 07/04/2019 - 17:40
Article

The Importance of Vectorization for Intel Microarchitectures (Fortran Example)

Reference Link and Download

Intel Vectorization Tools

Authored by Martyn Corden (Intel) Last updated on 07/03/2019 - 20:00
Article

Explicit Vector Programming – Best Known Methods

Vectorizing improves performance, and achieving high performance can save power. Introduction to tools for vectorizing compute-intensive processing.
Authored by Last updated on 04/24/2019 - 11:25
Video

Optimize a Pythagorean Prime Number Finder Using OpenMP* with the Intel® Fortran Compiler

Video tutorial explaining how to parallelize a Pythagorean prime number finder using the Intel® Visual Fortran Compiler with OpenMP*.

Authored by admin Last updated on 12/20/2018 - 15:36
Blog post

Introduction to OpenMP* on YouTube*

Tim Mattson (Intel) has authored an extensive series of excellent videos as in introduction to OpenMP*.

Authored by Mike P. (Intel) Last updated on 07/04/2019 - 19:51
Article

OpenMP 4.0 New features Supported in Intel® Compiler 16.0

This article is to introduce two new OpenMP 4.0 features supported by Intel® Compiler 16.0. They are User-defined reductions for POD types in C/C++ program and array reductions in Fortran program.
Authored by Chen, Yuan (Intel) Last updated on 03/09/2019 - 12:30
Article

Peel the Onion (Optimization Techniques)

This paper is a more formal response to an Intel® Developer Zone forum posting. See: (https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures/topic/590710).
Authored by jimdempseyatthecove (Blackbelt) Last updated on 12/12/2018 - 18:00
Blog post

Vectorized Reduction 2: Let the Compiler do that Voodoo that it do so well

As I mentioned in my previous post about writing a vectorized reduction code from Intel vector intrinsics, that part of the code was just the finishing touch on a loop computing squared difference of complex values.
Authored by Clay B. (Blackbelt) Last updated on 12/12/2018 - 18:08