过滤器

视频

第 4.8 集并行规约

我们将讨论 OpenMP for 循环中的并行规约。

作者: tianhui s. 最后更新时间: 2017/06/14 - 11:30
Article

2012 Cloud Computing and Its Core Technology,CCF YOCSEF-Intel Workshop

回到英特尔学术社区首页>>

作者: 管理 最后更新时间: 2017/06/14 - 13:05
博客

线程并行化的概念及其用法

An Intro to Multi-Level Parallelism for High-Performance Computing by Clay Breshears | Life Sciences Software Architect, Intel
作者: Clay B. 最后更新时间: 2017/06/14 - 16:03
Article

整理您的数据和代码: 数据和布局 - 第 2 部分

Apply the concepts of parallelism and distributed memory computing to your code to improve software performance. This paper expands on concepts discussed in Part 1, to consider parallelism, both vectorization (single instruction multiple data SIMD) as well as shared memory parallelism (threading), and distributed memory computing.
作者: David M. 最后更新时间: 2017/06/07 - 12:22
Article

Fletcher 校验和的快速计算能力

Checksums are widely used for checking the integrity of data in applications such as storage and networking. We present fast methods of computing checksums on Intel® processors. Instead of computing the checksum of the input with a traditional linear method, we describe a faster method to split the data into a number of interleaved parallel streams, compute the checksum on these segments in...
作者: James Guilford (Intel) 最后更新时间: 2017/06/07 - 10:51
Article

循环修改增强数据并行性能

When confronted with nested loops, the granularity of the computations that are assigned to threads will directly affect performance. Loop transformations such as splitting and merging nested loops can make parallelization easier and more productive.
作者: 管理 最后更新时间: 2017/06/01 - 11:27
Article

通过避免或消除人工相关性实现并行性

Many applications and algorithms contain serial optimizations that inadvertently introduce data dependencies and inhibit parallelism. One can often remove such dependences through simple transforms, or even avoid them altogether through.
作者: 管理 最后更新时间: 2017/06/07 - 12:25
Article

异构分布式系统上的有限差分

Our building block is the FD compute kernels that are typically used for RTM (reverse time migration) algorithms for seismic imaging. The computations performed by the ISO-3DFD (Isotropic 3-dimensional finite difference) stencils play a major role in accurate imaging of complex subsurface structures in oil and gas surveys and exploration. Here we leverage the ISO-3DFD discussed in [1] and [2] and...
作者: Leonardo B. (Intel) 最后更新时间: 2017/06/07 - 12:03
Article

方案:基于英特尔® 至强融核™ 处理器 x 200 的面向深度学习优化的 Caffe*

The computer learning code Caffe* has been optimized for Intel® Xeon Phi™ processors. This article provides detailed instructions on how to compile and run this Caffe* optimized for Intel® architecture to obtain the best performance on Intel Xeon Phi processors.
作者: Vamsi Sripathi (Intel) 最后更新时间: 2017/08/10 - 03:33
Article

PARSEC* 3.0 中的多线程代码优化: BlackScholes

The Black-Scholes benchmark is a one of the 13 benchmarks in the PARSEC. This benchmark does option pricing with Black-Scholes Partial Differential Equation (PDE). The Black-Scholes equation is a differential equation that describes how, under a certain set of assumptions, the value of an option changes as the price of the underlying asset changes. Based on this formula, one can compute the...
作者: Artem G. (Intel) 最后更新时间: 2017/06/07 - 12:15
有关编译器优化的更完整信息,请参阅优化通知