Article

OpenMP* and the Intel® IPP Library

How to configure OpenMP in the Intel IPP library to maximize multi-threaded performance of the Intel IPP primitives.
Authored by Last updated on 07/31/2019 - 14:30
Article

Improving Averaging Filter Performance Using Intel® Cilk™ Plus

Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism.  It provides three new keywords to i

Authored by Anoop M. (Intel) Last updated on 12/12/2018 - 18:00
Article

Vectorizing Loops with Calls to User-Defined External Functions

Introduction

Authored by Anoop M. (Intel) Last updated on 12/12/2018 - 18:00
Article

Fast Gathering-based SpMxV for Linear Feature Extraction

This algorithm can be used to improve sparse matrix-vector and matrix-matrix multiplication in any numerical computation. As we know, there are lots of applications involving semi-sparse matrix computation in High Performance Computing. Additionally, in popular perceptual computing low-level engines, especially speech and facial recognition, semi-sparse matrices are found to be very common....
Authored by Last updated on 12/12/2018 - 18:00
Article

Fast Computation of Huffman Codes

The generation of Huffman codes is used in many applications, among them the DEFLATE compression algorithm. The classical way to compute these codes uses a heap data structure. This approach is fairly efficient, but traditional software implementations contain lots of branches that are data-dependent and thus hard for general-purpose CPU hardware to predict. On modern processors with deep...
Authored by James Guilford (Intel) Last updated on 07/09/2019 - 16:09
Article

Implementing a Masked SVML-like Function Explicitly in User-Defined Way

The Intel® Compiler provides SIMD intrinsics APIs for short vector math library (SVML) and starting with Intel® Advanced Vector Extensions

Authored by Last updated on 07/16/2019 - 08:37
Article

Programação Vetorial e Paralela com amplificador Intel® VTune™

Eduardo H. M. Cruz, Matheus S. Serpa, Arthur M. Krause, Philippe O. A. Navaux

Authored by Last updated on 12/12/2018 - 18:00
Blog post

Big Datasets from Small Experiments

Authored by Andrey Vladimirov Last updated on 07/04/2019 - 18:46
Blog post

Brain Development Simulation, 300x Faster

Authored by Andrey Vladimirov Last updated on 07/04/2019 - 17:45
Article

Caffe* Optimized for Intel® Architecture: Applying Modern Code Techniques

This paper demonstrates a special version of Caffe* — a deep learning framework originally developed by the Berkeley Vision and Learning Center (BVLC) — that is optimized for Intel® architecture.
Authored by Last updated on 10/15/2019 - 15:30