Article

Parallelization Using OpenMP*

 

Compiler Methodology for Intel® MIC Architecture

Authored by admin Last updated on 03/21/2019 - 12:00
Article

OpenMP* 4.0 Combined Offload Constructs Support for the Intel® Xeon Phi™ Coprocessor

The Intel® Parallel Studio XE 2015 Composer Editions for Windows* and Linux* have feature enhancements that provide near full support of the OpenMP* 4.0 API (July 2013) specification.

Authored by Last updated on 06/14/2019 - 14:35
Event

3 Tuning Secrets for better OpenMP* performance using Intel® VTune™ Amplifier XE

Parallelism delivers the capability High Performance Computing (HPC) requires.
Authored by admin Last updated on 12/18/2017 - 14:08
File Wrapper

Parallel Universe Magazine - Issue 16, November 2013

Authored by admin Last updated on 12/12/2018 - 18:08
File Wrapper

Parallel Universe Magazine - Issue 24, March 2016

Authored by admin Last updated on 12/12/2018 - 18:08
Article

Efficient Parallelization

This article is part of the Intel® Modern Code Developer Community documentation which supports developers in leveraging application performance in code through a systematic step-by-step optimization framework methodology. This article addresses: Thread level parallelization.
Authored by Ronald W Green (Blackbelt) Last updated on 03/21/2019 - 12:00
Event

Fast, light weight, scalable MPI performance analysis

Developers of modern HPC applications face a challenge when scaling out their hybrid (MPI/OpenMP) applications.
Authored by admin Last updated on 09/01/2016 - 09:53
Article

Understanding MPI Load Imbalance with Intel®Trace Analyzer and Collector

Download Article
Authored by Last updated on 07/06/2019 - 16:40
Article

FAT Binary Created for Windows* Static (.lib) and Dynamic-Link (.dll) Library Containing Intel® Xeon Phi™ Coprocessor Offload Code

The Intel® Parallel Studio XE 2015 Composer Editions for Windows* have an internal implementation feature enhancement enabling the Intel® 15.0 compilers to create a FAT binary file for the Windows*

Authored by Last updated on 06/14/2019 - 14:36
Article

Run-to-Run Reproducibility of Floating-Point Calculations for Applications on Intel® Xeon Phi™ Coprocessors (and Intel® Xeon® Processors)

The Issue

If I rerun the identical program on the identical input data on an identical processor, will I get an identical result?

Authored by Martyn Corden (Intel) Last updated on 06/14/2019 - 12:10