Intel® Parallel Studio

Using Intel® Inspector XE 2011 to Find Data Races in Multithreaded Code

Intel Inspector XE 2011 automatically finds memory errors, deadlocks and other conditions that could lead to deadlocks, data races, thread . Some specific issues associated with debugging multithreaded applications will be discussed in this article.
  • Développeurs
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Débutant
  • Intel® Parallel Studio
  • Intel® Parallel Studio XE
  • Intel® Inspector XE
  • Intel® Parallel Inspector
  • critical section
  • data races
  • Learning Lab
  • OpenMP*
  • Informatique parallèle
  • Parallélisation
  • Loop Modifications to Enhance Data-Parallel Performance

    When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
  • Développeurs
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Serveur
  • Intermédiaire
  • Compilateur Intel® C++
  • Compilateur Intel® Fortran
  • Intel® Parallel Composer
  • Intel® Parallel Studio
  • Intel® Parallel Studio XE
  • Optimisation
  • Informatique parallèle
  • Granularity and Parallel Performance

    One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
  • Développeurs
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Serveur
  • Intermédiaire
  • Compilateur Intel® C++
  • Compilateur Intel® Fortran
  • Intel® Parallel Composer
  • Intel® Parallel Studio
  • Intel® Parallel Studio XE
  • Informatique parallèle
  • Avoiding Heap Contention Among Threads

    Avoiding Heap Contention Among Threads (PDF 256KB)

    Abstract

    Allocating memory from the system heap can be an expensive operation due to a lock used by system runtime libraries to synchronize access to the heap. Contention on this lock can limit the performance benefits from multithreading. To solve this problem, apply an allocation strategy that avoids using shared locks, or use third party heap managers.

  • Développeurs
  • Microsoft Windows* 8.x
  • Serveur
  • Intermédiaire
  • Intel® Parallel Studio
  • Intel® Parallel Amplifier
  • synchronization
  • heap contention
  • dynamic memory allocation
  • lock contention
  • stack allocation
  • Informatique parallèle
  • Parallélisation
  • Using Intel® AVX without Writing AVX

    Intel® AVX is a new 256-bit instruction set extension to Intel® Streaming SIMD Extensions and is designed for applications that are floating point intensive. This paper discusses options to integrate Intel® AVX into an application via use of intrinsics.
  • Linux*
  • Microsoft Windows* (XP, Vista, 7)
  • Apple OS X*
  • Avancé
  • Compilateur Intel® C++
  • Compilateur Intel® Fortran
  • Intel® Parallel Composer
  • Intel® Parallel Studio
  • Intel® Parallel Studio XE
  • Emberson beta program
  • Intel® Advanced Vector Extensions
  • Informatique parallèle
  • S’abonner à Intel® Parallel Studio