Filters

Article

A Parallel Stable Sort Using C++11 for TBB, Cilk Plus, and OpenMP

This article describes a parallel merge sort code, and why it is more scalable than parallel quicksort or parallel samplesort. The code relies on the C++11 “move” semantics.

Authored by Last updated on 06/07/2017 - 10:37
Article

Quick start to performance analysis and code optimization with Intel® Cilk™ Plus or OpenMP* and Intel® System Studio or Intel® Parallel Studio XE

This article demonstrates how to start using the optimization methodology with Intel VTune Amplifier & Intel Cilk Plus language extensions quickly and easily
Authored by Last updated on 05/30/2018 - 07:00
Article

Books - Message Passing Interface (MPI)

This article looks at several books that introduce developers to the topics of Message Passing Interface (MPI), parallel programming, and OpenMP*.
Authored by Mike P. (Intel) Last updated on 12/12/2018 - 18:00
Article

Code Sample: Exploring MPI for Python* on Intel® Xeon Phi™ Processor

Learn how to write an MPI program in Python*, and take advantage of Intel® multicore architectures using OpenMP threads and Intel® AVX512 instructions.
Authored by Nguyen, Loc Q (Intel) Last updated on 07/06/2019 - 16:30
Article

Optimization Techniques for the Intel® MIC Architecture: Part 1 of 3

Part one of this three-part series focuses on thread parallelism and race conditions, and discusses using mutexes in OpenMP* to resolve race conditions.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

Process and Thread Affinity for Intel® Xeon Phi™ Processors

The Intel® MPI Library and OpenMP* runtime libraries can create affinities between processes or threads, and hardware resources. This affinity keeps an MPI process or OpenMP thread from migrating to a different hardware resource, which can have a dramatic effect on the execution speed of a program.
Authored by Gregg S. (Intel) Last updated on 03/21/2019 - 12:00
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:10
Article

Fine-Tuning Optimization for a Numerical Method for Hyperbolic Equations Applied to a Porous Media Flow Problem with Intel® Tools

This paper presents an analysis for potential optimization for a Godunov-type semi-discrete central scheme, for a particular hyperbolic problem implicated in porous media flow, using OpenMP* and Intel® Advanced Vector Extensions 2.
Authored by Last updated on 07/03/2019 - 20:00
Blog post

The New Parallel Universe Magazine is Out: All About Vectorization

Parallel Universe is Intel's quarterly magazine that explores inroads and innovations in software development. The new issue takes a deep dive into the subject of vectorization and what it can do for you. Our first feature article looks at the SIMD directives for explicit vector programming now available in OpenMP. The second article walks you through Vectorization Advisor, a new tool in the...
Authored by Sally Sams (Intel) Last updated on 12/31/2018 - 15:00
Video

Have a Heart: Love your Hybrid Programs

Are you working with a hybrid program that just isn't performing? Do you feel like your application is on life support?

Authored by admin Last updated on 02/12/2018 - 15:21