Article

Requirements for Vectorizable Loops

Vectorization is one of many optimizations that are enabled by default in the latest Intel compilers. In order to be vectorized, loops must obey certain conditions, listed below. Some additional ways to help the compiler to vectorize loops are described.
作者: Martyn Corden (Intel) 最后更新时间: 2019/03/27 - 14:36
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
作者: 管理 最后更新时间: 2019/07/05 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
作者: 管理 最后更新时间: 2019/07/05 - 19:52
Article

Improving Averaging Filter Performance Using Intel® Cilk™ Plus

Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism.  It provides three new keywords to i

作者: Anoop M. (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

Vectorizing Loops with Calls to User-Defined External Functions

Introduction

作者: Anoop M. (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

Explicit Vector Programming – Best Known Methods

Vectorizing improves performance, and achieving high performance can save power. Introduction to tools for vectorizing compute-intensive processing.
作者: 最后更新时间: 2019/04/24 - 11:25
Article

Alignment of Allocatable Arrays & Pointers in Intel Fortran Compiler

The Intel® Parallel Studio XE 2017 or later for Fortran Windows* and Linux* have a feature enhancement supporting ASSUME_ALIGNED directive at point of use for allocatable arrays or pointers.

作者: Duan, Xiaoping (Intel) 最后更新时间: 2019/03/22 - 12:39
File Wrapper

Parallel Universe Magazine - Issue 9, February 2012

作者: 管理 最后更新时间: 2018/12/12 - 18:08
Article

Programming and Compiling for Intel® Many Integrated Core Architecture

This article discussions parallelization and provides links that will help you understand your programming environment and evaluate the suitability of your app.
作者: AmandaS (Intel) 最后更新时间: 2019/09/30 - 17:28
Article

Efficient Parallelization

This article is part of the Intel® Modern Code Developer Community documentation which supports developers in leveraging application performance in code through a systematic step-by-step optimization framework methodology. This article addresses: Thread level parallelization.
作者: Ronald W Green (Blackbelt) 最后更新时间: 2019/09/30 - 17:28