Article

Single-Producer/Single-Consumer Queue

Unbounded single-producer/single-consumer queue. Internal non-reducible cache of nodes is used. Dequeue operation is always wait-free. Enqueue operation is wait-free in common case. No atomic RMW operations nor heavy memory fences are used.
Authored by Dmitry Vyukov Last updated on 12/12/2018 - 18:00
Article

Optimize embarrassingly parallel loops

Algorithms that display data parallelism with iteration independence lend themselves to loops that exhibit ‘embarrassingly parallel’ code. We look at examples to maximize the performance of such loops with minimal effort.
Authored by Ganesh R. (Intel) Last updated on 12/31/2018 - 15:00
Article

Namespace-scope using-declarations for class member types

In Microsoft compatibility, the namespace-scope using-declarations for class member types are no longer accepted by Intel C++ compiler.
Authored by admin Last updated on 05/25/2018 - 15:30
Article

Parallel Lint

The article describes a new direction in development of static code analyzers - verification of parallel programs. The article reviews several static analyzers which can claim to be called "Parallel Lint".
Authored by Andrey Karpov (Blackbelt) Last updated on 05/25/2018 - 15:30
Article

About cross compiling

Summary related information how to cross compile projects using -32 and -m64
Authored by Last updated on 05/25/2018 - 15:30
Article

Variables with "__declspec(dllimport)" Should Not Be Initialized Where Used

initializing dllimport variable in user code error: variable may not be initialized
Authored by Last updated on 12/27/2018 - 11:07
Article

enums are treated as unsigned ints in ISO C standard

/Za equivalent in Linux is -strict-ansi strict Ansi-compliance
Authored by admin Last updated on 06/01/2017 - 11:32
Article

Intel® AVX C/C++ Intrinsics Emulation

This C/C++ header file can be used for AVX emulation on Intel CPUs/processors without h/w AVX support
Authored by Max Locktyukhin (Intel) Last updated on 02/12/2019 - 13:16
Article

IIR Gaussian Blur Filter Implementation using Intel® Advanced Vector Extensions

This white paper proposes an implementation for the Infinite Impulse Response (IIR) Gaussian blur filter using Intel® Advanced Vector Extensions (Intel® AVX) instructions. For a 2048x2048 image size, the AVX implementation is ~2X faster than the SSE code.
Authored by Brijender Bharti (Intel) Last updated on 06/07/2017 - 12:19
Article

FFT Length and Layout Advisor

Multidimensional Fast Fourier Transform (FFT) - selecting optimal sizes and data layout
Authored by Last updated on 07/07/2019 - 10:48