Article

The Three Stages of Preparation for Optimizing Parallel Software

Improving software performance on parallel software requires a structured approach that makes good use of development resources, obtaining good results quickly.

Authored by aaron-tersteeg (Intel) Last updated on 07/05/2019 - 10:15
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Article

并行软件优化三步走

提高并行软件的性能需要采用结构化的方法,充分利用开发资源,快速取得出色成效。本文将这种方法划分为三个不同阶段:  

Authored by aaron-tersteeg (Intel) Last updated on 07/05/2019 - 10:18
Article

Три этапа подготовки к оптимизации параллельных программ

Повышение производительности параллельных программ требует хорошо структурированного подхода, активно использующего инструменты разработчика и быстро дающего положительные результаты. В данной статье приведены три этапа данного подхода.
Authored by aaron-tersteeg (Intel) Last updated on 07/05/2019 - 10:13
Article

Os Três Estágios da Preparação para Otimização de Software Paralelo

A melhoria de desempenho no software paralelo requer uma abordagem estruturada que faça um bom uso dos recursos de desenvolvimento, obtendo bons resultados rapidamente.

Authored by aaron-tersteeg (Intel) Last updated on 07/05/2019 - 10:15
Article

Improving Averaging Filter Performance Using Intel® Cilk™ Plus

Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism.  It provides three new keywords to i

Authored by Anoop M. (Intel) Last updated on 12/12/2018 - 18:00
Article

Eight Optimizations for 3-Dimensional Finite Difference (3DFD) Code with an Isotropic (ISO)

This article describes how to implement and optimize a three-dimension isotropic kernel with finite differences to run on the Intel® Xeon® Processor and Intel® Xeon Phi™.
Authored by Cédric ANDREOLLI (Intel) Last updated on 07/06/2019 - 16:40
Article

Code Sample: Allocate Memory Efficiently on an Intel® Xeon Phi™ Processor

How to efficiently use Multi-Channel DRAM (MCDRAM) and synchronous dynamic random-access memory.
Authored by Mike P. (Intel) Last updated on 07/06/2019 - 16:40
Article

Recognize and Measure Vectorization Performance

Get a background on vectorization and learn different techniques to evaluate its effectiveness.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

Optimize Data Layout with SIMD Templates

Contrast results for manually tuning financial data and using data layout templates in the Intel® C++ Compiler.
Authored by Nimisha R. (Intel) Last updated on 12/12/2018 - 18:00