Filtros

Forum topic

Intel OpenMP Runtime: TASK_ID in __kmp_task_alloc(...) routine call

Hi ..

I am trying to retrieve task (explicit) identity in runtime. I find the routine call __kmp_task_alloc: Allocate the taskdata and task data structures for a task, Here 

Autor Rezaul R. Última actualización 11/02/2016 - 18:19
Blog post

An example to quickly solve performance issue in OpenMP* program by using VTune Amplifier’s results

If you compile and run your OpenMP* code with Intel Compiler 13.1 Update 2 or later, use advanced-hotspots from VTune(TM) Amplifier XE 2015 Update 4 to get important metrics, they can be categorized into "Serial Time" and "Parallel Region Time". Also “OpenMP Potential Gain” is provided to let you know if you have more works to optimize code. Meanwhile, VTune Amplifier highlight on:
Autor Peter Wang Última actualización 10/02/2016 - 15:24
Forum topic

Is there actually a difference between two tight loop parallelization OpenMP pragmas ?

Dear collegues,

Is there actually a difference between :

#pragma omp parallel for for (int i = 0; i < 10; i++) {       /* **** */ }

AND

Autor Arthur V. Ratz Última actualización 05/02/2016 - 22:31
Forum topic

Is there any difference between using different synchronization objects supported by OpenMP and Win32api ?

Dear collegues,

Is there any difference between using :

omp_set_lock(...), omp_unset_lock(...),

Win32api EnterCriticalSection(&cs), LeaveCriticalSection(&cs)

Autor Arthur V. Ratz Última actualización 05/02/2016 - 22:17
Forum topic

How to properly pin each h/w thread to a particular CPU core using KMP_AFFINITY macro ?

Can you post an example of how to properly pin each h/w thread to a particular CPU core using KMP_AFFINITY macro ?

Thanks in advance.

Cheers, Arthur.

Autor Arthur V. Ratz Última actualización 05/02/2016 - 22:14
Article

Putting Your Data and Code in Order: Data and layout - Part 2

In this pair of articles on performance and memory covers basic concepts to provide guidance to developers seeking to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Autor David M. Última actualización 05/02/2016 - 16:49
Article

Using Intel® Inspector XE to Find Data Races in Multithreaded Code

Intel Inspector XE automatically finds memory errors, deadlocks and other conditions that could lead to deadlocks, data races, thread . Some specific issues associated with debugging multithreaded applications will be discussed in this article.
Autor admin Última actualización 05/02/2016 - 12:07
Article

Explicit Vector Programming in Fortran

No longer does Moore’s Law result in higher frequencies and improved scalar application performance; instead, higher transistor counts lead to increased parallelism, both through more cores and thr

Autor Martyn Corden (Intel) Última actualización 04/02/2016 - 13:31
Blog post

Slides da palestra sobre Computação Paralela no FISL14

A palestra "Como domar uma fera de 1 TFlop que cabe na palma da sua mão" foi apresentada em 3/7/13, no FISL14, por Luciano Palma - Community Manager da Intel para Servidores e Computação de Alto De

Autor Luciano Palma (Intel) Última actualización 03/02/2016 - 10:18
Article

Intel® Parallel Studio XE 2015 Update 2 Cluster Edition Readme

The Intel® Parallel Studio XE 2015 Update 2 Cluster Edition for Linux* and Windows* combines all Intel® Parallel Studio XE and Intel® Cluster Tools into a single package.

Autor Gergana S. (Intel) Última actualización 02/02/2016 - 11:21
Para obtener información más completa sobre las optimizaciones del compilador, consulte nuestro Aviso de optimización.