Vectorizing Loops with Calls to User-Defined External Functions


Автор: Anoop M. (Intel) Последнее обновление: 12.12.2018 - 18:00

Putting Your Data and Code in Order: Data and layout - Part 2

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Автор: David M. Последнее обновление: 15.10.2019 - 16:40