A melhoria de desempenho no software paralelo requer uma abordagem estruturada que faça um bom uso dos recursos de desenvolvimento, obtendo bons resultados rapidamente.
(This work was done by Vivek Lingegowda during his internship at Intel.)
Most multi-threaded software uses locking. Lock optimization traditionally has aimed to reduce lock contention, that is make the critical regions smaller.
Intel® TSX exposes a speculative execution mode to the programmer to improve locking performance.. Tuning speculation requires heavily on a PMU profiler.
We demonstrate how to create a Sierpinski Carpet in OpenCL* 2.0
Tim Mattson (Intel) has authored an extensive series of excellent videos as in introduction to OpenMP*.