Фильтры

Article

Using Tasks Instead of Threads

Tasks are a lightweight alternative to threads that provide faster startup and shutdown times, better load balancing, an efficient use of available resources, and a higher level of abstraction.
Автор: админ Последнее обновление: 05.07.2019 - 09:41
Article

Exploiting Data Parallelism in Ordered Data Streams

This article identifies some of these challenges and illustrates strategies for addressing them while maintaining parallel performance.
Автор: админ Последнее обновление: 05.07.2019 - 14:50
Article

Performance Benefits of Half Precision Floats

Half precision floats are 16-bit floating-point numbers, which are half the size of traditional 32-bit single precision floats, and have lower precision and smaller range.

Автор: Patrick Konsor (Intel) Последнее обновление: 10.07.2019 - 17:05
Блоги

最快线程间数据交换算法,有效避免锁竞争 -- TwoQueues

处理多线程数据共享问题注意的几个要点:

1、锁竞争:尽量减少锁竞争的时间和次数。

2、内存:尽量是使用已分配内存,减少内存分配和释放的次数。尽量是用连续内存,减少共享占用的内存量。

多线程数据交换简单方案A:

定义一个list,再所有操作list的地方进行加锁和解锁。

简单模拟代码:

Автор: Последнее обновление: 04.07.2019 - 21:30
Article

Vectorizing Loops with Calls to User-Defined External Functions

Introduction

Автор: Anoop M. (Intel) Последнее обновление: 12.12.2018 - 18:00
Блоги

The switch() statement isn't really evil, right?

In my current position, I work to optimize and parallelize codes that deal with genomic data, e.g., DNA, RNA, proteins, etc.

Автор: Clay B. (Blackbelt) Последнее обновление: 04.07.2019 - 10:46
Article

Improve Intel® MKL Performance for Small Problems: The Use of MKL_DIRECT_CALL

One of the big new features introduced in the Intel® Math Kernel Library (Intel® MKL) 11.2 is the greatly improved performance for small problem sizes.

Автор: Zhang, Zhang (Intel) Последнее обновление: 07.07.2019 - 10:35
Блоги

opencl_node overview

Introduction
Автор: Alex (Intel) Последнее обновление: 30.05.2018 - 07:08
Article

使用 OpenCL™ 2.0 读写图片

While Image convolution is not as effective with the new Read-Write images functionality, any image processing technique that needs be done in place may benefit from the Read-Write images. One example of a process that could be used effectively is image composition. In OpenCL 1.2 and earlier, images were qualified with the “__read_only” and __write_only” qualifiers. In the OpenCL 2.0, images can...
Автор: Последнее обновление: 31.05.2019 - 14:20
Article

Using Intel® MPI Library 5.1 on Microsoft* Windows* with Microsoft* MPI based applications

Why it is needed?
Автор: Dmitry S. (Intel) Последнее обновление: 12.12.2018 - 20:11