Video

第 4 集:线程并行化和 OpenMP*

我们将讨论软件线程,尤其是使用 OpenMP 库的多线程实施。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 6 集:Fork-Join 模型 OpenMP* 任务

现在我们来介绍 Fork-Join 并行化。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 5 集:并行循环、私有和共享变量、调度

我们将介绍私有和共享变量、并行循环及其调度。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 8 集:并行规约

我们将讨论 OpenMP for 循环中的并行规约。

Authored by Last updated on 04/26/2019 - 04:06
Article

著作 - High Performance Parallelism Pearls

A look into the contents of the two "Pearls" books, edited by James Reinders and Jim Jeffers. These books contain a collection of examples of code modernization.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

面向英特尔® 架构优化的 Caffe*:使用现代代码技巧

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 07/06/2019 - 16:40
Article

准确预报各种天气:英特尔五步框架帮助实现代码现代化

天气预报是现代生活的一个重要方面,它可在出现恶劣天气状况时即时发出警报,从而帮助有效制定计划和安排物流,并可保护生命财产安全。 但是,准确预测长期的天气情况非常复杂,通常涉及到大量数据集,并且要求对代码进行优化以利用最高级的计算机硬件功能。

Authored by Last updated on 03/21/2019 - 12:00
Article

面向英特尔® 至强融核™ 处理器(代号“Knights Landing”)的开发人员访问计划

Intel is bringing to market, in anticipation of general availability of the Intel® Xeon Phi™ Processor (codenamed Knights Landing), the Developer Access Program (DAP). DAP is an early access program for developers worldwide to purchase an Intel Xeon Phi Processor based system.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

异构分布式系统上的有限差分

Our building block is the FD compute kernels that are typically used for RTM (reverse time migration) algorithms for seismic imaging. The computations performed by the ISO-3DFD (Isotropic 3-dimensional finite difference) stencils play a major role in accurate imaging of complex subsurface structures in oil and gas surveys and exploration. Here we leverage the ISO-3DFD discussed in [1] and [2] and...
Authored by Leonardo B. (Intel) Last updated on 07/06/2019 - 16:40