Blog post

最快线程间数据交换算法,有效避免锁竞争 -- TwoQueues

处理多线程数据共享问题注意的几个要点:

1、锁竞争:尽量减少锁竞争的时间和次数。

2、内存:尽量是使用已分配内存,减少内存分配和释放的次数。尽量是用连续内存,减少共享占用的内存量。

多线程数据交换简单方案A:

定义一个list,再所有操作list的地方进行加锁和解锁。

简单模拟代码:

Authored by Last updated on 07/04/2019 - 21:30
Article

使用 OpenCL™ 2.0 读写图片

While Image convolution is not as effective with the new Read-Write images functionality, any image processing technique that needs be done in place may benefit from the Read-Write images. One example of a process that could be used effectively is image composition. In OpenCL 1.2 and earlier, images were qualified with the “__read_only” and __write_only” qualifiers. In the OpenCL 2.0, images can...
Authored by Last updated on 05/31/2019 - 14:20
Article

使用任务(而非线程)

Tasks are a lightweight alternative to threads that provide faster startup and shutdown times, better load balancing, an efficient use of available resources, and a higher level of abstraction.
Authored by admin Last updated on 07/05/2019 - 09:51
Article

利用有序数据流中的数据并行性

This article identifies some of these challenges and illustrates strategies for addressing them while maintaining parallel performance.
Authored by admin Last updated on 07/05/2019 - 14:50
Article

SPDK 助力加速 NVMe 硬盘

The Storage Performance Development Kit (SPDK) is an open source set of tools and libraries hosted on GitHub that helps developers create high-performance and scalable storage applications. This tutorial will focus on the userspace NVMe driver provided by SPDK and will step you through a Hello World example.
Authored by Steven B. (Intel) Last updated on 07/05/2019 - 19:40
Article

自动矢量化失败后应该怎么办?

This article completes an analysis of a problem erroneously reported on the Intel® Developer Zone forum: Vectorization failed because of unsigned integer? It provides a more detailed examination showing that unsigned integer is not impacting compiler vectorization but what methodology to use when a modern C/C++ compiler fails to auto-vectorize for-loops.
Authored by Last updated on 07/05/2019 - 14:46
Blog post

弃用 PCOMMIT 指令

PCOMMIT 指令已被弃用。
Authored by Rudoff, Andy M (Intel) Last updated on 07/04/2019 - 19:20
Article

如何在英特尔® 架构服务器上仿真持久性内存

了解如何在采用 4.3 或更高版本的 Linux* 内核的英特尔® 处理器上使用常规动态随即访问内存 (DRAM) 设置持久性内存仿真。本文将介绍其硬件配置和初始设置。
Authored by Thai Le (Intel) Last updated on 07/05/2019 - 20:07
Article

英特尔® 至强融核™ 处理器优化教程

In this tutorial, we demonstrate some possible ways to optimize an application to run on the Intel® Xeon Phi™ processor
Authored by Nguyen, Loc Q (Intel) Last updated on 09/30/2019 - 17:28
Article

面向英特尔® 至强融核™ 处理器的 Offload over Fabric教程

This tutorial shows how to install Offload over Fabric (OoF) software on 2nd generation Intel® Xeon Phi™ processor, configure the hardware, test the basic configuration, and enable OoF
Authored by Nguyen, Loc Q (Intel) Last updated on 09/30/2019 - 17:28