Article

Weird OpenMP Reduction

Typical reductions in OpenMP* involve using a associative operator op to do local reductions, and then using a

Authored by Last updated on 06/07/2017 - 09:21
Article

Choosing the right threading framework

This is the second article in a series of articles about High Performance Computing with the Intel Xeon Phi.

Authored by Last updated on 07/06/2019 - 16:30
Article

Programação Vetorial e Paralela com amplificador Intel® VTune™

Eduardo H. M. Cruz, Matheus S. Serpa, Arthur M. Krause, Philippe O. A. Navaux

Authored by Last updated on 12/12/2018 - 18:00
Blog post

BKMs on the use of the SIMD directive

We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.

Authored by Last updated on 07/06/2019 - 17:00
Blog post

Applying Intel® Threading Building Blocks Observers for Thread Affinity on Intel® Xeon Phi™ Coprocessors

In spite of the fact that the Intel® Threading Building Blocks (Intel® TBB) library [1] [2] provides high-level task based parallelism intended to hide sof

Authored by Alex (Intel) Last updated on 08/01/2019 - 09:30
Article

Optimizing Memory Bandwidth on Stream Triad

Download Article
Authored by Karthik Raman (Intel) Last updated on 07/06/2019 - 16:40
Article

Resource Guide for Intel® Xeon Phi™ Coprocessor Developers

This article makes recommendations for how a developer can get up to speed quickly on the Intel® Many Integrated Core (Intel® MIC) Architecture. This is one of three articles: For the Administrator, for the Developer, and for the Investigator. Who is a Developer? Someone who will be programming on an Intel Many Integrated Core (Intel MIC) architecture. The assumption is that they are most...
Authored by Last updated on 07/06/2019 - 16:30
Article

Compiling and Linking Intel® Math Kernel Library with Microsoft* Visual C++*

The article provides hints for linking your program with Intel® MKL from the Microsoft* Visual Studio Environment: Microsoft* Visual Studio 2017/2015/2013/2012/2010 -- Automatically Microsoft* Visual Studio 2017/2015/2013/2012/2010 -- Manually
Authored by Fiona Z. (Intel) Last updated on 05/25/2018 - 15:30
Article

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

Why is Cilk™ Plus not speeding up my program? (Part 1)

In this article, I discuss some common performance pitfalls in Cilk™ Plus programs that prevent users from seeing speedups in their code, and describe some techniques for avoiding these pitfalls.
Authored by Last updated on 02/04/2019 - 10:40