过滤器

Article

Choosing between OpenMP* and Explicit Threading Methods

OpenMP provides a powerful, portable, and simple means of threading applications. In some cases, however, developers should choose the flexibility of native threading APIs. The guidelines in this article help to identify whether OpenMP is an appropriate choice for a given situation.
作者: 最后更新时间: 2017/06/01 - 11:19
Article

Getting Started with OpenMP*

Abstract
作者: 最后更新时间: 2019/07/08 - 15:10
Article

Intel® 64 Architecture Processor Topology Enumeration

Download Code Package: 20160519-cpuid_topo.tar.gz
作者: 最后更新时间: 2019/07/05 - 20:39
Article

PowerTOP Primer

Note: All information from this article has been taken from https://01.org/powertop/  
作者: Mike P. (Intel) 最后更新时间: 2019/07/12 - 14:44
博客

Why P scales as C*V^2*f is so obvious (pt 2)

THE GORY DETAILS

Let’s continue from where we left off last time. Let’s figure out the why of the equation,

P = C * V^2 * (a * f)

作者: 最后更新时间: 2019/05/09 - 22:55
Article

Troubleshooting InfiniBand connection issues using OFED tools

This article describes how to troubleshoot some common InfiniBand issues using the tools provided by the Open Fabrics Enterprise Distribution (OFED).
作者: 最后更新时间: 2019/03/09 - 13:08
Article

Automatic Parallelization with Intel® Compilers

With automatic parallelization, the compiler detects loops that can be safely and efficiently executed in parallel and generates multithreaded code.
作者: 管理 最后更新时间: 2019/07/04 - 21:33
Article

Predicting and Measuring Parallel Performance

The success of parallelization is typically quantified by measuring the speedup of the parallel version relative to the serial version. It is also useful to compare that speedup relative to the upper limit of the potential speedup.
作者: 管理 最后更新时间: 2019/07/05 - 10:33
Article

Loop Modifications to Enhance Data-Parallel Performance

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
作者: 管理 最后更新时间: 2019/07/05 - 14:47
Article

Granularity and Parallel Performance

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
作者: 管理 最后更新时间: 2019/07/05 - 19:52