Article

应用蚁群优化算法 (ACO) 实施交通网络扩展

In this article an OpenMP* based implementation of the Ant Colony Optimization algorithm was analyzed for bottlenecks with Intel® VTune™ Amplifier XE 2016 together with improvements using hybrid MPI-OpenMP and Intel® Threading Building Blocks were introduced to achieve efficient scaling across a four-socket Intel® Xeon® processor E7-8890 v4 processor-based system.
Authored by Sunny G. (Intel) Last updated on 07/05/2019 - 19:13
Video

第 9 集:分布式内存并行化和 MPI

在本章上一集中,我们学习了如何使用矢量在每个内核的矢量平面间并行化计算。 然后,我们讨论了如何使用 OpenMP 在每颗处理器或协处理器的内核间扩展应用。 接下来,在本章最后一集 4.9 集中,我们将研究下一级别的并行化:在多台计算设备和集群环境的多个计算节点间扩展。

Authored by Last updated on 04/26/2019 - 04:06
Article

著作 - High Performance Parallelism Pearls

A look into the contents of the two "Pearls" books, edited by James Reinders and Jim Jeffers. These books contain a collection of examples of code modernization.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

什么是代码现代化?

现代高性能计算机由下列资源组合构建而成:多核处理器、

Authored by Mike P. (Intel) Last updated on 07/06/2019 - 16:40
Article

异构分布式系统上的有限差分

Our building block is the FD compute kernels that are typically used for RTM (reverse time migration) algorithms for seismic imaging. The computations performed by the ISO-3DFD (Isotropic 3-dimensional finite difference) stencils play a major role in accurate imaging of complex subsurface structures in oil and gas surveys and exploration. Here we leverage the ISO-3DFD discussed in [1] and [2] and...
Authored by Leonardo B. (Intel) Last updated on 07/06/2019 - 16:40
Article

如何在英特尔® 至强融核™ 处理器中使用 MPI-3 共享内存

学习如何在英特尔® 至强融核™ 处理器中使用 MPI-3 共享内存
Authored by Nguyen, Loc Q (Intel) Last updated on 07/06/2019 - 16:30
Article

面向使用 PME 工作负载的对称英特尔® MPI 的 GROMACS 方案

目标

该文件包(脚本及其说明)提供了针对对称英特尔运行的构建和运行环境。 该文件实际上是自述 (README) 文件包。 对称指采用至强™ 可执行文件和至强融核™ 可执行文件,两者通过英特尔 MPI 同时运行以传输 MPI 消息和集体数据。

Authored by Heinrich Bockhorst (Intel) Last updated on 07/06/2019 - 16:40