博客

The switch() statement isn't really evil, right?

In my current position, I work to optimize and parallelize codes that deal with genomic data, e.g., DNA, RNA, proteins, etc.

作者: Clay B. (Blackbelt) 最后更新时间: 2019/07/04 - 10:46
博客

Building a Local Developer Community: A Conversation with Intel East Africa SSG's Fredrick Odhiambo

The Intel Software and Services Group (SSG) opened its first office in East Africa in April 2013. This was a big move for Intel, which had previously only had offices in South Africa and Egypt.

作者: 最后更新时间: 2018/01/24 - 12:12
Article

Intel(R) Metrics Framework Getting Started Guide

Click "Download Now" below to obtain a copy of Intel(R) Metrics Framework Getting Started Guide.

作者: 最后更新时间: 2019/07/12 - 14:52
博客

Null Pointer Dereferencing Causes Undefined Behavior

I have unintentionally raised a large debate recently concerning the question if it is legal in C/C++ to use the &P->m_foo expression with P being a null pointer.

作者: Andrey Karpov (Blackbelt) 最后更新时间: 2018/05/30 - 07:08
博客

The Last Line Effect

作者: Andrey Karpov (Blackbelt) 最后更新时间: 2018/05/30 - 07:00
Article

Using Intel® MKL and Intel® TBB in the same application

Intel MKL 11.3 has introduced Intel TBB support.

作者: Gennady F. (Blackbelt) 最后更新时间: 2019/08/01 - 09:22
Article

Implementing a Masked SVML-like Function Explicitly in User-Defined Way

The Intel® Compiler provides SIMD intrinsics APIs for short vector math library (SVML) and starting with Intel® Advanced Vector Extensions

作者: 最后更新时间: 2019/07/16 - 08:37
Article

Intel® Math Kernel Library - Introducing Vectorized Compact Routines

Introduction     
作者: Gennady F. (Blackbelt) 最后更新时间: 2019/07/04 - 21:35
Article

Intel® Data Analytics Acceleration Library - Decision Trees

Decision trees method is one of most popular approaches in machine learning. They can easily be used to solve different classification and regression tasks.
作者: Gennady F. (Blackbelt) 最后更新时间: 2019/09/17 - 16:25
博客

The JITter Conundrum - Just in Time for Your Traffic Jam

In interpreted languages, it just takes longer to get stuff done - I earlier gave the example where the Python source code a = b + c would result in a BINARY_ADD byte code which takes 78 machine instructions to do the add, but it's a single native ADD instruction if run in compiled language like C or C++. How can we speed this up? Or as the performance expert would say, how do I decrease...
作者: David S. (Blackbelt) 最后更新时间: 2019/10/15 - 19:42