Parallel Universe Magazine #12: Advanced Vectorization

This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:

作者: 最后更新时间: 2019/07/03 - 20:08

Exploring Intel® Transactional Synchronization Extensions with Intel® Software Development Emulator

Intel® Transactional Synchronization Extensions (Intel® TSX) is perhaps one of the most non-trivial extensions of instruction set architecture introduced in the 4th generation Intel® Cor

作者: Roman Dementiev (Intel) 最后更新时间: 2019/07/06 - 17:00

Windows* 8 Tutorial: Writing a Multithreaded Application for the Windows Store* using Intel® Threading Building Blocks.

It is known that the Windows Store apps API is missing some commonly used thread functions, such as function CreateThread() and those that

作者: Vladimir Polin (Intel) 最后更新时间: 2018/12/20 - 09:40

Intel® System Studio - Multicore Programming with Intel® Cilk™ Plus

Intel System Studio not only provides a variety of signal processing primitives via Intel® Integrated Performance Primitives (Intel® IPP), and Intel® Math Kernel Library (Intel® MKL), but also allows developing high-performance low-latency custom code (Intel C++ Compiler with Intel Cilk Plus). Since Intel Cilk Plus is built into the compiler, it can be used where it demands an efficient threading...
作者: Hans P. (Intel) 最后更新时间: 2017/12/11 - 10:48

Combiner/Aggregator Synchronization Primitive

Combiner/Aggregator synchronization primitive provides mutual exclusion like a mutex, but can be significantly faster in some situations due to better cache performance.
作者: Dmitry Vyukov 最后更新时间: 2017/06/14 - 16:09

Optimization of Data Read/Write in a Parallel Application

(This work was done by Vivek Lingegowda during his internship at Intel.)

作者: 最后更新时间: 2019/07/04 - 17:40

Tutorial Windows* 8: Escrevendo uma Aplicação Multithreaded para a Windows Store* usando a biblioteca Intel® Threading Building Blocks.

É sabido que a API das aplicações Windows Store não disponibiliza algumas funções comuns para trabalhar com Threads, como a CreateThread e

作者: 最后更新时间: 2018/12/20 - 09:40

The switch() statement isn't really evil, right?

In my current position, I work to optimize and parallelize codes that deal with genomic data, e.g., DNA, RNA, proteins, etc.

作者: Clay B. (Blackbelt) 最后更新时间: 2019/07/04 - 10:46

Scale-Up Implementation of a Transportation Network Using Ant Colony Optimization (ACO)

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
作者: Sunny G. (Intel) 最后更新时间: 2019/07/05 - 19:10