Фильтры

Article

A Library Based Approach to Threading for Performance

Download PDF

Download "A Library Based Approach to Threading for Performance" [PDF 78KB]

Автор: Последнее обновление: 20.03.2019 - 13:20
Article

OpenMP und inkrementelle Parallelisierung - (article in german)

In diesem Artikel wird der inkrementelle OpenMP Ansatz zur Parallelisierung von sequentiellen Programmen vorgestellt. Der Schwerpunkt liegt auf der praktischen Darstellung von einfachen Programmbeispielen und nicht auf der Vollständigkeit der Beschreibung
Автор: админ Последнее обновление: 12.12.2018 - 18:00
Article

Parallel Lint

The article describes a new direction in development of static code analyzers - verification of parallel programs. The article reviews several static analyzers which can claim to be called "Parallel Lint".
Автор: Andrey Karpov (Blackbelt) Последнее обновление: 25.05.2018 - 15:30
Article

Consistency of Floating-Point Results using the Intel® Compiler

Tradeoffs between floating-point accuracy, reproducibility and performance. Updated for Intel® Compiler version 19.
Автор: Martyn Corden (Intel) Последнее обновление: 19.12.2018 - 11:49
Article

Image Processing Acceleration Techniques using Intel® Streaming SIMD Extensions and Intel® Advanced Vector Extensions

This article details optimized implementations of data transformations and algorithms together with analysis comparing performance and providing speedup measurements for Intel® SSE optimized code and estimates for Intel® AVX optimized code.
Автор: Larsson, Petter (Blackbelt) Последнее обновление: 25.05.2018 - 15:30
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Автор: админ Последнее обновление: 05.07.2019 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Автор: админ Последнее обновление: 05.07.2019 - 19:52
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Автор: Последнее обновление: 31.07.2019 - 14:30
Article

Avoiding Relocation Errors when Building Applications with Large Global or Static Data on Intel64 Linux

Applications with >2GB of static or global data should be built with –mcmodel=medium –shared-intel on Intel64 Linux*. If linked with static libraries, these should also be built with –mcmodel=medium. Else, "relocation truncated to fit" errors may occur.
Автор: Martyn Corden (Intel) Последнее обновление: 07.06.2017 - 10:10
Article

Detecting Memory Bandwidth Saturation in Threaded Applications

Detecting Memory Bandwidth Saturation in Threaded Applications (PDF 23

Автор: админ Последнее обновление: 05.07.2019 - 19:57