Article

Step-by-Step Application Performance Tuning with Intel Compilers

A step-by-step introduction to application performance tuning using the Intel® Compilers version 13 for IA-32 and Intel® 64 processors that are included with Intel® Parallel Studio XE 2013
Authored by Martyn Corden (Intel) Last updated on 05/10/2019 - 08:30
Article

Intel® VTune™ Amplifier XE - Profiling Windows* Services

Steps to profile Windows* services by Intel® VTune™ Amplifier XE
Authored by Kirill R. (Intel) Last updated on 06/23/2019 - 18:50
Blog post

Parallel Universe Magazine #12: Advanced Vectorization

This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:

Authored by Last updated on 07/03/2019 - 20:08
Article

A Matrix Multiplication Routine that Updates Only the Upper or Lower Triangular Part of the Result Matrix

  Background

Intel® MKL provides the general purpose BLAS*  matrix multiply routines ?GEMM defined as follows:

Authored by Zhang, Zhang (Intel) Last updated on 07/12/2019 - 14:46
Article

Coding for Performance: Data alignment and structures

This article collects the general knowledge and Best-Known-Methods (BKMs) for aligning of data within structures in order to achieve optimal performance. 

Authored by Sumedh N. (Intel) Last updated on 06/07/2017 - 09:07
Blog post

BKMs on the use of the SIMD directive

We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.

Authored by Last updated on 07/06/2019 - 17:00
Article

Resource Guide for People Investigating the Intel® Xeon Phi™ Coprocessor

This article identifies resources for anyone investigating the value to their organization of the Intel® Xeon Phi™ coprocessor, which is based on the Intel® Many Integrated Core (Intel® MIC) archit

Authored by Last updated on 06/14/2019 - 12:10
Blog post

Performance BKMs: There’s more than one hammer

I don’t know if any of you have noticed but Intel® has a tendency to emphasize its own homegrown tools. This isn’t bad as Intel has some of the best.

Authored by Last updated on 07/06/2019 - 17:10
Article

Improve Intel® MKL Performance for Small Problems: The Use of MKL_DIRECT_CALL

One of the big new features introduced in the Intel® Math Kernel Library (Intel® MKL) 11.2 is the greatly improved performance for small problem sizes.

Authored by Zhang, Zhang (Intel) Last updated on 07/07/2019 - 10:35
Article

Diagnostic 15319: loop was not vectorized: novector directive used

Product Version:  Intel® Fortran Compiler 15.0 and above

Authored by Devorah H. (Intel) Last updated on 05/25/2018 - 15:30