Article

An Introduction to the Cilk Screen Race Detector

One of the pitfalls of parallel programming is the need to consider whether your code modifies a memory location in two parallel strands.

Criado por Barry Tannenbaum (Intel) Última atualização em 25/05/2018 - 15:30
Mensagem de blog

The switch() statement isn't really evil, right?

In my current position, I work to optimize and parallelize codes that deal with genomic data, e.g., DNA, RNA, proteins, etc.

Criado por Clay B. (Blackbelt) Última atualização em 04/07/2019 - 10:46
Article

Intel® Platform Analysis Library Metrics Framework User Guide

Click "Download" below to obtain and view Intel® Platform Analysis Library Metrics Framework User Guide

Criado por Última atualização em 23/06/2019 - 18:50
Mensagem de blog

Null Pointer Dereferencing Causes Undefined Behavior

I have unintentionally raised a large debate recently concerning the question if it is legal in C/C++ to use the &P->m_foo expression with P being a null pointer.

Criado por Andrey Karpov (Blackbelt) Última atualização em 30/05/2018 - 07:08
Article

Using Intel® MKL and Intel® TBB in the same application

Intel MKL 11.3 has introduced Intel TBB support.

Criado por Gennady F. (Blackbelt) Última atualização em 01/08/2019 - 09:22
Article

Intel® Math Kernel Library - Introducing Vectorized Compact Routines

Introduction     
Criado por Gennady F. (Blackbelt) Última atualização em 04/07/2019 - 21:35
Article

Intel® Data Analytics Acceleration Library - Decision Trees

Decision trees method is one of most popular approaches in machine learning. They can easily be used to solve different classification and regression tasks.
Criado por Gennady F. (Blackbelt) Última atualização em 17/09/2019 - 16:25
Mensagem de blog

The JITter Conundrum - Just in Time for Your Traffic Jam

In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
Criado por David S. (Blackbelt) Última atualização em 15/10/2019 - 19:42
Mensagem de blog

Core Challenge In Speeding Up Python, PHP, HHVM, Node.js...

A traditional compiler translates a high-level computer program into machine code for the CPU you want to run it on. An interpreted language translates a high-level language into the machine code for some imaginary CPU. For historical reasons, this imaginary CPU is called a "virtual machine" and its instructions are called "byte code." One advantage of this approach is development speed: creating...
Criado por David S. (Blackbelt) Última atualização em 15/10/2019 - 19:43
Mensagem de blog

The Last Line Effect

Criado por Andrey Karpov (Blackbelt) Última atualização em 20/11/2019 - 08:19