Filters

Article

Intel® MKL and Intel® IPP: Choosing a High Performance FFT

The purpose of this document is to help developers determine which FFT, Intel® MKL or Intel® IPP is best suited for their application.
Authored by Last updated on 07/31/2019 - 14:23
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Authored by admin Last updated on 07/05/2019 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Authored by admin Last updated on 07/05/2019 - 19:52
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Article

Avoiding Heap Contention Among Threads

Avoiding Heap Contention Among Threads (PDF 256KB)

Abstract

Authored by admin Last updated on 07/05/2019 - 19:58
Article

Detecting Memory Bandwidth Saturation in Threaded Applications

Detecting Memory Bandwidth Saturation in Threaded Applications (PDF 23

Authored by admin Last updated on 07/05/2019 - 19:57
Article

Superscalar Programming 101 (Matrix Multiply) Part 1 of 5

Part one of a five-part series, this article teaches a methodology to interpret statistics gathered during test runs and use those interpretations to improve parallel code.
Authored by jimdempseyatthecove (Blackbelt) Last updated on 07/04/2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 2 of 5

By Jim DempseyIn my last article we left off with

Authored by jimdempseyatthecove (Blackbelt) Last updated on 07/04/2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 3 of 5

By Jim Dempsey

Authored by jimdempseyatthecove (Blackbelt) Last updated on 07/04/2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 4 of 5

In the last installment (Part 3) we saw the effects of the QuickThread Parallel Tag Team method of Matrix Multiplica

Authored by jimdempseyatthecove (Blackbelt) Last updated on 07/04/2019 - 22:00