Фильтры

Article

OpenMP und inkrementelle Parallelisierung - (article in german)

In diesem Artikel wird der inkrementelle OpenMP Ansatz zur Parallelisierung von sequentiellen Programmen vorgestellt. Der Schwerpunkt liegt auf der praktischen Darstellung von einfachen Programmbeispielen und nicht auf der Vollständigkeit der Beschreibung
Автор: админ Последнее обновление: 12.12.2018 - 18:00
Article

Requirements for Vectorizable Loops

Vectorization is one of many optimizations that are enabled by default in the latest Intel compilers. In order to be vectorized, loops must obey certain conditions, listed below. Some additional ways to help the compiler to vectorize loops are described.
Автор: Martyn Corden (Intel) Последнее обновление: 27.03.2019 - 14:36
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Автор: админ Последнее обновление: 05.07.2019 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Автор: админ Последнее обновление: 05.07.2019 - 19:52
Article

Avoiding Heap Contention Among Threads

Avoiding Heap Contention Among Threads (PDF 256KB)

Abstract

Автор: админ Последнее обновление: 05.07.2019 - 19:58
Article

Superscalar Programming 101 (Matrix Multiply) Part 1 of 5

Part one of a five-part series, this article teaches a methodology to interpret statistics gathered during test runs and use those interpretations to improve parallel code.
Автор: jimdempseyatthecove (Blackbelt) Последнее обновление: 04.07.2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 2 of 5

By Jim DempseyIn my last article we left off with

Автор: jimdempseyatthecove (Blackbelt) Последнее обновление: 04.07.2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 3 of 5

By Jim Dempsey

Автор: jimdempseyatthecove (Blackbelt) Последнее обновление: 04.07.2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 4 of 5

In the last installment (Part 3) we saw the effects of the QuickThread Parallel Tag Team method of Matrix Multiplica

Автор: jimdempseyatthecove (Blackbelt) Последнее обновление: 04.07.2019 - 22:00
Article

Superscalar programming 101 (Matrix Multiply) Part 5 of 5

In part 4 we saw the effects of the QuickThread Parallel Tag Team Transpose method of Matrix Multiplication performe

Автор: jimdempseyatthecove (Blackbelt) Последнее обновление: 04.07.2019 - 22:00