Artigo técnico

Pipeline and the Efficient Chef (Part 2)

Advanced computer concepts for the (not so) common Chef

In Pipeline and the Efficient Chef (Part 1), we showed how the basic pipeline is equivalent to what our Chef does when following one step in his recipe. To say it differently, the execution of one machine language instruction is equivalent to our Chef performing one step of a complicated recipe.

使用英特尔® Inspector XE 2011 发现多线程代码中的数据竞跑

Intel Inspector XE automatically finds memory errors, deadlocks and other conditions that could lead to deadlocks, data races, thread . Some specific issues associated with debugging multithreaded applications will be discussed in this article.
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Principiante
  • Intel® Parallel Studio XE
  • Intel® Parallel Studio XE Composer Edition
  • Intel® Inspector
  • Intel® Parallel Inspector
  • critical section
  • data races
  • Learning Lab
  • OpenMP*
  • Computação paralela
  • Thread
  • 优化数据结构和内存访问模式以改进数据局部性

    优化数据结构和内存访问模式以改进数据局部性 (PDF 782KB)

    摘要

    高速缓存是最重要的现代 CPU 资源之一:它是体积更小、速度更快的一部分内存子系统,用于保存最常用的内存位置副本。 当驻留在高速缓存中的指令需要数据时,该指令将会立即执行。 否则,指令执行过程可能会中止,直到从内存获取到所需的数据。 由于从内存中拷贝数据是一项延迟较长的操作,因此我们希望通过对算法和数据结构进行设计,以充分利用数据局部性,从而最大限度降低缓存缺失。

    本文将介绍数据局部性较差的表现、检测相关性能瓶颈的技巧,以及可解决该问题的优化方法。

  • Servidor
  • Computação paralela
  • 粒度与并行性能

    One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Servidor
  • Intermediário
  • Intel® Parallel Studio XE
  • Intel® Parallel Studio XE Composer Edition
  • Compilador C++ Intel®
  • Compilador Fortran Intel®+
  • Computação paralela
  • Assine o Artigo técnico