Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:10
File Wrapper

Parallel Universe Magazine - Issue 16, November 2013

Authored by admin Last updated on 12/12/2018 - 18:08
File Wrapper

Parallel Universe Magazine - Issue 24, March 2016

Authored by admin Last updated on 12/12/2018 - 18:08
Article

Understanding MPI Load Imbalance with Intel® Trace Analyzer and Collector

Download Article
Authored by Last updated on 08/28/2019 - 11:04
Article

An Introduction to MPI-3 Shared Memory Programming

In this article, we present a tutorial on how to start using MPI SHM on multinode systems using Intel® Xeon® and Intel® Xeon Phi™ processors. The article uses a 1-D ring application as an example and includes code snippets to describe how to transform common MPI send/receive patterns to utilize the MPI SHM interface. The MPI functions that are necessary for internode and intranode communications...
Authored by Last updated on 07/27/2018 - 08:58
Article

HPL Application Note

This guide is intended to help current HPL users get better benchmark performance by utilizing BLAS from the Intel® Math Kernel Library (Intel® MKL).
Authored by Vipin Kumar E K (Intel) Last updated on 03/11/2019 - 12:04
Article

GROMACS recipe for symmetric Intel® MPI using PME workloads

Objectives
Authored by Heinrich Bockhorst (Intel) Last updated on 07/06/2019 - 16:40