Article

Improve Intel® MKL Performance for Small Problems: The Use of MKL_DIRECT_CALL

One of the big new features introduced in the Intel® Math Kernel Library (Intel® MKL) 11.2 is the greatly improved performance for small problem sizes.

Authored by Zhang, Zhang (Intel) Last updated on 07/07/2019 - 10:35
Article

Further Vectorization Features of the Intel® Compiler - Webinar Code Samples

The code samples for the webinar "Further Vectorization Features of the Intel® Compiler" given on 4/7/2015 are attached below.

Authored by Martyn Corden (Intel) Last updated on 07/11/2018 - 19:21
Article

Peel the Onion (Optimization Techniques)

This paper is a more formal response to an Intel® Developer Zone forum posting. See: (https://software.intel.com/en-us/forums/intel-moderncode-for-parallel-architectures/topic/590710).
Authored by jimdempseyatthecove (Blackbelt) Last updated on 12/12/2018 - 18:00
Article

Distributed Memory Coarray Programs with Process Pinning

This article describes a method to compile and run a distributed memory coarray program using Intel® Parallel Studio XE Cluster Edition for Linux . An example using Linux* is presented.
Authored by Kenneth Craft (Intel) Last updated on 10/15/2019 - 21:20