Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
作者: 最后更新时间: 2019/07/31 - 14:30
Article

Implementing a Masked SVML-like Function Explicitly in User-Defined Way

The Intel® Compiler provides SIMD intrinsics APIs for short vector math library (SVML) and starting with Intel® Advanced Vector Extensions

作者: 最后更新时间: 2019/07/16 - 08:37
Article

Code Sample: Intel® AVX512-Deep Learning Boost: Intrinsic Functions

How developers can use to take advantage of the new Intel® AVX512-Deep Learning Boost (Intel® AVX512-DL Boost) instructions.
作者: Alberto V. (Intel) 最后更新时间: 2019/04/02 - 10:04
Article

Quick Analysis of Vectorization Using Intel® Advisor

Find out how to use the command-line interface in Intel® Advisor 2017 for a quick, initial analysis of loop performance that gives an overview of the hotspots in your code.
作者: Alberto V. (Intel) 最后更新时间: 2019/09/30 - 17:28
Article

Performance of Classic Matrix Multiplication Algorithm on Intel® Xeon Phi™ Processor System

Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
作者: 最后更新时间: 2019/10/15 - 15:30
Article

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
作者: Sunny G. (Intel) 最后更新时间: 2019/10/15 - 16:40