Article

Weird OpenMP Reduction

Typical reductions in OpenMP* involve using a associative operator op to do local reductions, and then using a

Authored by Last updated on 06/07/2017 - 09:21
Article

OpenMP* 5.0 support in Intel® Compiler 18.0

OpenMP 5.0 is the next version of the OpenMP specification which should be officially released in 2018.

Authored by Igor V. (Intel) Last updated on 03/09/2019 - 12:30
Article
Article

Updated Support for OpenMP* 4.0 features added in Composer XE 2013 SP1

[Updated based on the version of update 2 of 2013 SP1]

Authored by Last updated on 03/09/2019 - 12:30
Article

A Parallel Stable Sort Using C++11 for TBB, Cilk Plus, and OpenMP

This article describes a parallel merge sort code, and why it is more scalable than parallel quicksort or parallel samplesort. The code relies on the C++11 “move” semantics.

Authored by Last updated on 08/01/2019 - 09:30
Article

Quick start to performance analysis and code optimization with Intel® Cilk™ Plus or OpenMP* and Intel® System Studio or Intel® Parallel Studio XE

This article demonstrates how to start using the optimization methodology with Intel VTune Amplifier & Intel Cilk Plus language extensions quickly and easily
Authored by Last updated on 05/30/2018 - 07:00
Article

Code Sample: Exploring MPI for Python* on Intel® Xeon Phi™ Processor

Learn how to write an MPI program in Python*, and take advantage of Intel® multicore architectures using OpenMP threads and Intel® AVX512 instructions.
Authored by Nguyen, Loc Q (Intel) Last updated on 10/15/2019 - 15:30
Article

Process and Thread Affinity for Intel® Xeon Phi™ Processors

The Intel® MPI Library and OpenMP* runtime libraries can create affinities between processes or threads, and hardware resources. This affinity keeps an MPI process or OpenMP thread from migrating to a different hardware resource, which can have a dramatic effect on the execution speed of a program.
Authored by Gregg S. (Intel) Last updated on 10/15/2019 - 15:30
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 10/15/2019 - 16:40
Article

Fine-Tuning Optimization for a Numerical Method for Hyperbolic Equations Applied to a Porous Media Flow Problem with Intel® Tools

This paper presents an analysis for potential optimization for a Godunov-type semi-discrete central scheme, for a particular hyperbolic problem implicated in porous media flow, using OpenMP* and Intel® Advanced Vector Extensions 2.
Authored by Last updated on 07/03/2019 - 20:00