Article

Single-Producer/Single-Consumer Queue

Unbounded single-producer/single-consumer queue. Internal non-reducible cache of nodes is used. Dequeue operation is always wait-free. Enqueue operation is wait-free in common case. No atomic RMW operations nor heavy memory fences are used.
作者: Dmitry Vyukov 最后更新时间: 2018/12/12 - 18:00
Article

Improving Averaging Filter Performance Using Intel® Cilk™ Plus

Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism.  It provides three new keywords to i

作者: Anoop M. (Intel) 最后更新时间: 2018/12/12 - 18:00
Article

Putting Your Data and Code in Order: Optimization and Memory – Part 1

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
作者: David M. 最后更新时间: 2018/12/12 - 18:00
Article

整理您的数据和代码: 优化和内存 — 第 1 部分

This series of two articles discusses how data and memory layout affect performance and suggests specific steps to improve software performance. The basic steps shown in these two articles can yield significant performance gains. These two articles are designed at an intermediate level. It is assumed the reader desires to optimize software performance using common C, C++ and Fortran* programming...
作者: David M. 最后更新时间: 2018/12/12 - 18:00
Article

用于亚洲期权定价的 Monte Carlo 模拟

This is an exercise in performance optimization on heterogeneous Intel architecture systems based on multi-core processors and manycore (MIC) coprocessors.
作者: Mike P. (Intel) 最后更新时间: 2019/09/30 - 17:30
Article

Monte-Carlo simulation on Asian Options Pricing

This is an exercise in performance optimization on heterogeneous Intel architecture systems based on multi-core processors and manycore (MIC) coprocessors.
作者: Mike P. (Intel) 最后更新时间: 2019/09/30 - 17:28
Article

英特尔® 至强融核™ 处理器优化教程

In this tutorial, we demonstrate some possible ways to optimize an application to run on the Intel® Xeon Phi™ processor
作者: Nguyen, Loc Q (Intel) 最后更新时间: 2019/09/30 - 17:28
Article

Direct N-body Simulation

Exercise in performance optimization on Intel Architecture, including Intel® Xeon Phi™ processors.
作者: Mike P. (Intel) 最后更新时间: 2019/09/30 - 17:28
Article

Code Sample: Allocate Memory Efficiently on an Intel® Xeon Phi™ Processor

How to efficiently use Multi-Channel DRAM (MCDRAM) and synchronous dynamic random-access memory.
作者: Mike P. (Intel) 最后更新时间: 2019/10/15 - 15:30
Article

How to use the MPI-3 Shared Memory in Intel® Xeon Phi™ Processors

Code Sample included: Learn how to use MPI-3 shared memory feature using the corresponding APIs on the Intel® Xeon Phi™ processor.
作者: Nguyen, Loc Q (Intel) 最后更新时间: 2019/10/15 - 15:30