Article

英特尔® MKL 提供针对 2D/3D FFT 的 Split Complex (real real) 支持

Split complex (real real) support for 2D/3D FFTs has been added from Intel® MKL 10.3 onwards.
Autor Vipin Kumar E K (Intel) Última actualización 27/03/2019 - 12:20
Article

英特尔® 线程构建模块:面向多核的可扩展编程

Intel’s new parallel programming model is a new set of Libraries developed by Intel Software and Solutions Group in order to help developers write scalable code without worrying about managing threads.
Autor Última actualización 25/05/2018 - 15:30
Article

OpenCL™ Device Fission 助力 CPU 性能

下载 PDF

Autor Última actualización 31/05/2019 - 14:20
Article
Article

循环修改增强数据并行性能

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Autor admin Última actualización 05/07/2019 - 14:48
Article

检测线程应用中的内存带宽饱和度

检测线程应用中的内存带宽饱和度 (PDF 231KB)

Autor admin Última actualización 05/07/2019 - 19:58
Article

避免并发现线程之间的假共享

避免并发现线程之间的假共享 (PDF 218KB)

摘要

Autor admin Última actualización 25/05/2018 - 15:30
Article

粒度与并行性能

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Autor admin Última actualización 05/07/2019 - 19:53
Article

选择性地使用 gatherhint/scatterhint 指令

面向英特尔® MIC 架构的编译器方法

选择性地使用 gatherhint/scatterhint 指令

Autor AmandaS (Intel) Última actualización 30/09/2019 - 17:30
Article

高效并行化

高效并行化文档

面向英特尔® 集成众核架构的编译器方法

高效并行化

Autor Ronald W Green (Blackbelt) Última actualización 30/09/2019 - 17:30