博客

Parallel Universe Magazine #12: Advanced Vectorization

This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:

作者: 最后更新时间: 2019/07/03 - 20:08
Article

Diagnostic 15523: Loop was not vectorized: cannot compute loop iteration count before executing the loop.

Product Version: Intel(R) Visual Fortran Compiler XE 15.0 or a later version

作者: Devorah H. (Intel) 最后更新时间: 2018/05/25 - 15:30
Article

Diagnostic 15532: Loop was not vectorized: compile time constraints prevent loop optimization

Product Version: Intel(R) Visual Fortran Compiler XE 15.0 or a later version

作者: Devorah H. (Intel) 最后更新时间: 2019/07/05 - 14:23
Article

Diagnostic 15537: Loop was not vectorized: implied FP exception model prevents usage of SVML library.

Product Version: Intel® Visual Fortran Compiler XE 15.0 or a later version

作者: Devorah H. (Intel) 最后更新时间: 2018/05/25 - 15:30
Article
Article

Putting Your Data and Code in Order: Optimization and Memory – Part 1

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
作者: David M. 最后更新时间: 2018/12/12 - 18:00
Article

Fine-Tuning Optimization for a Numerical Method for Hyperbolic Equations Applied to a Porous Media Flow Problem with Intel® Tools

This paper presents an analysis for potential optimization for a Godunov-type semi-discrete central scheme, for a particular hyperbolic problem implicated in porous media flow, using OpenMP* and Intel® Advanced Vector Extensions 2.
作者: 最后更新时间: 2019/07/03 - 20:00
Article

Improve Vectorization Performance with Intel® AVX-512

See how the new Intel® Advanced Vector Extensions 512CD and the Intel AVX512F subsets (available in the Intel® Xeon Phi processor and in future Intel Xeon processors) lets the compiler automatically generate vector code with no changes to the code.
作者: Alberto V. (Intel) 最后更新时间: 2019/07/08 - 19:26
Article

Implementing a Masked SVML-like Function Explicitly in User-Defined Way

The Intel® Compiler provides SIMD intrinsics APIs for short vector math library (SVML) and starting with Intel® Advanced Vector Extensions

作者: 最后更新时间: 2019/07/16 - 08:37