Filters

Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Article
Blog post

Fun with Intel® Transactional Synchronization Extensions

By now, many of you have heard of Intel® Transactional Synchronization Extensions (Intel® TSX).

Authored by Last updated on 07/04/2019 - 17:00
Article

Getting the Most out of your Intel® Compiler with the New Optimization Reports

Intel compiler optimization reports guide the developer to performance improvements
Authored by Martyn Corden (Intel) Last updated on 09/30/2019 - 17:28
Article

Free access to Intel® Compilers, Performance libraries, Analysis tools and more...

Intel® Parallel Studio XE is a very popular product from Intel that includes the Intel® Compilers, Intel® Performance Libraries, tools for analysis, debugging and tuning, tools for MPI and the Intel® MPI Library. Did you know that some of these are available for free? Here is a guide to “what is available free” from the Intel Parallel Studio XE suites.
Authored by admin Last updated on 09/30/2019 - 17:28
Article

Understanding NUMA for 3D Isotropic Finite Difference (3DFD) Wave Equation Code

This article demonstrates techniques that software developers can use to identify and fix NUMA-related performance issues in their applications.
Authored by Sunny G. (Intel) Last updated on 10/02/2019 - 16:18
Article

了解面向三维同性有限差分 (3DFD) 波动方程代码的 NUMA

本文将介绍一些技巧,帮助软件开发人员识别并修复使用最新英特尔软件开发工具时遇到的与 NUMA 相关的应用性能问题。

Authored by Sunny G. (Intel) Last updated on 10/02/2019 - 16:18
Article

Intel® MKL Sparse BLAS Overview

Sparse BLAS routines can be useful to implement iterative methods for solving large sparse systems of equations or eigenvalue problems
Authored by Last updated on 10/08/2019 - 18:20
Article

Intel® IPP - Threading / OpenMP* FAQ

This page contains common questions and answers on multi-threading in the Intel IPP.
Authored by Last updated on 10/10/2019 - 10:48
Article

应用蚁群优化算法 (ACO) 实施交通网络扩展

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 10/15/2019 - 16:40