Article

使用 OpenMP* 实现并行化

面向英特尔® MIC 架构进行应用的适用性分析

文档

Authored by Ronald W Green (Blackbelt) Last updated on 03/21/2019 - 12:08
Article

在英特尔® 集成众核 (英特尔® MIC) 架构上使用 OpenMP* 的最佳设计方案

本文是对“面向 Linux* 的英特尔® Composer XE”文档的补充。 本文对在为英特尔集成众核 (Intel MIC) 架构编写卸载和本地程序时使用 C/C++ 和 Fortran 的 OpenMP* 扩展的最佳方法进行了概括。
Authored by Last updated on 03/21/2019 - 12:00
Article

应用蚁群优化算法 (ACO) 实施交通网络扩展

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:13
Article

高效并行化

高效并行化文档

面向英特尔® 集成众核架构的编译器方法

高效并行化

Authored by Ronald W Green (Blackbelt) Last updated on 03/21/2019 - 12:00
Article

面向英特尔® 至强融核™ 处理器(代号“Knights Landing”)的开发人员访问计划

Intel is bringing to market, in anticipation of general availability of the Intel® Xeon Phi™ Processor (codenamed Knights Landing), the Developer Access Program (DAP). DAP is an early access program for developers worldwide to purchase an Intel Xeon Phi Processor based system.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

循环修改增强数据并行性能

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
Authored by admin Last updated on 07/05/2019 - 14:48
Article

面向英特尔® 至强融核™ 协处理器(和英特尔® 至强® 处理器)架构应用的浮点计算 R2R 再现性

 

问题

如果在相同处理器上针对相同输入数据重新运行相同的程序,得到的结果相同吗?

Authored by Last updated on 03/21/2019 - 12:08
Article

粒度与并行性能

One key to attaining good parallel performance is choosing the right granularity for the application. Granularity is the amount of real work in the parallel task. If granularity is too fine, then performance can suffer from communication overhead.
Authored by admin Last updated on 07/05/2019 - 19:53
Article

解读Intel编译器的offload报告

英特尔编译器在对代码进行编译优化的过程中用户可以通过使用”-opt-report-phase=phase”选项让编译器输出某些特定优化阶段的相关信息。针对至强融核™ 协处理器提供的offload编译模式英特尔编译器提供了”offload”关键字。

Authored by Duan, Xiaoping (Intel) Last updated on 06/07/2017 - 10:36
Article

避免线程之间发生堆冲突

避免线程之间发生堆冲突 (PDF 256KB)

摘要

Authored by admin Last updated on 07/05/2019 - 19:59