Article

循环修改增强数据并行性能

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
作者: 管理 最后更新时间: 2019/07/05 - 14:48
Article

管理锁争用: 大、小关键代码段

This topic introduces the concept of critical section size, defined as the length of time a thread spends inside a critical section, and its effect on performance.
作者: 管理 最后更新时间: 2019/07/05 - 20:05
Article

使用线程化 API 提供的同步例程,而非手工编写同步例程

Application programmers sometimes write hand-coded synchronization routines rather than using constructs provided by a threading API in order to reduce synchronization overhead or provide different functionality than existing constructs offer.
作者: 管理 最后更新时间: 2019/07/05 - 20:03
Article

选择合适的同步基元以最大限度地减少开销

Currently, there are a number of synchronization mechanisms available, and it is left to the application developer to choose an appropriate one to minimize overall synchronization overhead.
作者: 管理 最后更新时间: 2019/07/05 - 20:02
Article

如有可能可使用非阻塞锁

Non-blocking system calls allow the competing thread to return on an unsuccessful attempt to the lock, and allow useful work to be done, thereby avoiding wasteful utilization of execution resources at the same time.
作者: 管理 最后更新时间: 2019/07/05 - 20:01
Article

避免线程之间发生堆冲突

避免线程之间发生堆冲突 (PDF 256KB)

摘要

作者: 管理 最后更新时间: 2019/07/05 - 19:59
Article

检测线程应用中的内存带宽饱和度

检测线程应用中的内存带宽饱和度 (PDF 231KB)

作者: 管理 最后更新时间: 2019/07/05 - 19:58
Article

预测和测量并行性能

The success of parallelization is typically quantified by measuring the speedup of the parallel version relative to the serial version. It is also useful to compare that speedup relative to the upper limit of the potential speedup.
作者: 管理 最后更新时间: 2019/07/05 - 10:35
Article

粒度与并行性能

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
作者: 管理 最后更新时间: 2019/07/05 - 19:53
Article

通过避免或消除人工相关性实现并行性

Many applications and algorithms contain serial optimizations that inadvertently introduce data dependencies and inhibit parallelism. One can often remove such dependences through simple transforms, or even avoid them altogether through.
作者: 管理 最后更新时间: 2019/07/05 - 19:50