Filters

Video

Part 5: Parallel Loops, Private and Shared Variables, Scheduling

We will introduce private and shared variables, parallel loops, and their scheduling.

Videos Within This Chapter:

Authored by admin Last updated on 03/21/2019 - 12:00
Video

Part 6: Fork-Join Model OpenMP* Tasks

Let's talk about Fork-Join parallelism.

Videos Within This Chapter:

Authored by admin Last updated on 03/21/2019 - 12:00
Video

Part 8: Parallel Reduction

We will talk about parallel reduction in OpenMP* for-loops.

Videos Within This Chapter:

Authored by admin Last updated on 03/21/2019 - 12:00
Video

Part 17: Optimization of Communication: MPI

In this episode, we will mention the optimization opportunities for distributed-memory applications that use Intel® MPI Library and Intel® Xeon Phi™ coprocessors.

Authored by admin Last updated on 03/21/2019 - 12:08
Article

著作 - High Performance Parallelism Pearls

A look into the contents of the two "Pearls" books, edited by James Reinders and Jim Jeffers. These books contain a collection of examples of code modernization.
Authored by Mike P. (Intel) Last updated on 03/21/2019 - 12:00
Article

Finite Differences on Heterogeneous Distributed Systems

Learn about a technique that deals with the load imbalance of heterogeneous distributed systems, plus get sample source code.
Authored by Leonardo B. (Intel) Last updated on 07/06/2019 - 16:40
Article

Get a Helping Hand from the Vectorization Advisor

Learn practical tips for using the vectorization advisor, which is part of Intel® Advisor.
Authored by Last updated on 07/06/2019 - 16:40
Article

High-Performance, Modern Code Optimizations for Computational Fluid Dynamics

Modern server farms consist of a large number of heterogeneous, energy-efficient, and very high-performance computing nodes connected with each other through a high-bandwidth network interconnect. Such systems pose one of the biggest challenges for engineers and scientists today: how to solve complex, real-world problems by efficiently using the enormous computational horsepower available from...
Authored by Last updated on 07/06/2019 - 16:40
Article

异构分布式系统上的有限差分

Our building block is the FD compute kernels that are typically used for RTM (reverse time migration) algorithms for seismic imaging. The computations performed by the ISO-3DFD (Isotropic 3-dimensional finite difference) stencils play a major role in accurate imaging of complex subsurface structures in oil and gas surveys and exploration. Here we leverage the ISO-3DFD discussed in [1] and [2] and...
Authored by Leonardo B. (Intel) Last updated on 07/06/2019 - 16:40
Video

第 4 集:线程并行化和 OpenMP*

我们将讨论软件线程,尤其是使用 OpenMP 库的多线程实施。

Authored by Last updated on 04/26/2019 - 04:06