This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
A Matrix Multiplication Routine that Updates Only the Upper or Lower Triangular Part of the Result MatrixBackground
Intel® MKL provides the general purpose BLAS* matrix multiply routines ?GEMM defined as follows:
This article collects the general knowledge and Best-Known-Methods (BKMs) for aligning of data within structures in order to achieve optimal performance.
We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.
This article identifies resources for anyone investigating the value to their organization of the Intel® Xeon Phi™ coprocessor, which is based on the Intel® Many Integrated Core (Intel® MIC) archit
I don’t know if any of you have noticed but Intel® has a tendency to emphasize its own homegrown tools. This isn’t bad as Intel has some of the best.
One of the big new features introduced in the Intel® Math Kernel Library (Intel® MKL) 11.2 is the greatly improved performance for small problem sizes.
Product Version: Intel® Fortran Compiler 15.0 and above