Article

Intel® MKL Threaded 1D FFTs

This document describes the cases for which the Intel MKL 10.2 and later 1D complex-to-complex FFTs are threaded.
作者: 最后更新时间: 2019/03/27 - 10:00
Article

Threading Fortran Applications for Parallel Performance on Multi-Core Systems

Advice and background information is given on typical issues that may arise when threading an application using the Intel Fortran Compiler and other software tools, whether using OpenMP, automatic parallelization or threaded libraries.
作者: Martyn Corden (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
作者: 最后更新时间: 2019/07/31 - 14:30
Article

The Importance of Vectorization for Intel Microarchitectures (Fortran Example)

Reference Link and Download

Intel Vectorization Tools

作者: Martyn Corden (Intel) 最后更新时间: 2019/07/03 - 20:00
Article

Improving Averaging Filter Performance Using Intel® Cilk™ Plus

Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism.  It provides three new keywords to i

作者: Anoop M. (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

Vectorizing Loops with Calls to User-Defined External Functions

Introduction

作者: Anoop M. (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

游戏行业的人工智能设计(第一部分)

The gaming industry has seen great strides in game complexity recently. Game developers are challenged to create increasingly compelling games. This series explores important Artificial Intelligence (AI) concepts and how to optimize them for multi-core.
作者: 管理 最后更新时间: 2018/12/12 - 18:00
Article

OpenMP 4.0 New features Supported in Intel® Compiler 16.0

This article is to introduce two new OpenMP 4.0 features supported by Intel® Compiler 16.0. They are User-defined reductions for POD types in C/C++ program and array reductions in Fortran program.
作者: Chen, Yuan (Intel) 最后更新时间: 2019/03/09 - 12:30
Article

Fast Gathering-based SpMxV for Linear Feature Extraction

This algorithm can be used to improve sparse matrix-vector and matrix-matrix multiplication in any numerical computation. As we know, there are lots of applications involving semi-sparse matrix computation in High Performance Computing. Additionally, in popular perceptual computing low-level engines, especially speech and facial recognition, semi-sparse matrices are found to be very common....
作者: 最后更新时间: 2018/12/12 - 18:00
Article

Peel the Onion (Optimization Techniques)

This paper is a more formal response to an Intel® Developer Zone forum posting. See: (https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures/topic/590710).
作者: jimdempseyatthecove (Blackbelt) 最后更新时间: 2018/12/12 - 18:00