How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
By now, many of you have heard of Intel® Transactional Synchronization Extensions (Intel® TSX).
Intel compiler optimization reports guide the developer to performance improvements
Intel® Parallel Studio XE is a very popular product from Intel that includes the Intel® Compilers, Intel® Performance Libraries, tools for analysis, debugging and tuning, tools for MPI and the Intel® MPI Library. Did you know that some of these are available for free? Here is a guide to “what is available free” from the Intel Parallel Studio XE suites.
This article demonstrates techniques that software developers can use to identify and fix NUMA-related performance issues in their applications.
本文将介绍一些技巧，帮助软件开发人员识别并修复使用最新英特尔软件开发工具时遇到的与 NUMA 相关的应用性能问题。
Sparse BLAS routines can be useful to implement iterative methods for solving large sparse systems of equations or eigenvalue problems
This page contains common questions and answers on multi-threading in the Intel IPP.
In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.