Video

第 4 集:线程并行化和 OpenMP*

我们将讨论软件线程,尤其是使用 OpenMP 库的多线程实施。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 5 集:并行循环、私有和共享变量、调度

我们将介绍私有和共享变量、并行循环及其调度。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 8 集:并行规约

我们将讨论 OpenMP for 循环中的并行规约。

Authored by Last updated on 04/26/2019 - 04:06
Video

第 7 集:竞态条件和互斥体

我们将讨论使用关键和原子编译时 OpenMP 线程之间的竞态条件和同步。

Authored by Last updated on 04/26/2019 - 04:06
Article

面向英特尔® 架构优化的 Caffe*:使用现代代码技巧

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 07/06/2019 - 16:40
Article

面向英特尔® 至强融核™ 处理器(代号“Knights Landing”)的开发人员访问计划

Intel is bringing to market, in anticipation of general availability of the Intel® Xeon Phi™ Processor (codenamed Knights Landing), the Developer Access Program (DAP). DAP is an early access program for developers worldwide to purchase an Intel Xeon Phi Processor based system.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
Authored by David M. Last updated on 07/06/2019 - 16:40
Article

什么是代码现代化?

现代高性能计算机由下列资源组合构建而成:多核处理器、

Authored by Mike P. (Intel) Last updated on 07/06/2019 - 16:40
Article

异构分布式系统上的有限差分

Our building block is the FD compute kernels that are typically used for RTM (reverse time migration) algorithms for seismic imaging. The computations performed by the ISO-3DFD (Isotropic 3-dimensional finite difference) stencils play a major role in accurate imaging of complex subsurface structures in oil and gas surveys and exploration. Here we leverage the ISO-3DFD discussed in [1] and [2] and...
Authored by Leonardo B. (Intel) Last updated on 07/06/2019 - 16:40
Article

案例研究: 面向神经细胞模拟优化代码

Intel held the Intel® Modern Code Developer Challenge that had about 2,000 students from 130 universities in 19 countries registered to participate in the Challenge. They were provided access to Intel® Xeon Phi™ coprocessors to optimize code used in a CERN openlab brain simulation research project. In this article Daniel Vea Falguera (Modern Code Developer Challenge winner) shares how he...
Authored by Last updated on 07/06/2019 - 16:40