In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi
This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
Performance tuning of an existing application is truly a challenge and it depends on a lot of factors like the nature of algorithm the application works on, if the implementation is scalable
Intel® Cilk™ Plus is an extension to the C and C++ languages to support data and task parallelism. It provides three new keywords to i
This article explains the sparse ruler problem, two parallel codes for computing sparse rulers, and some new results that reveal a surprising "gap" behavior for solutions to the sparse ruler proble
Trusted Tools in the New Android* World: Optimization Techniques - from Intel® SSE Intrinsics to Intel® Cilk™ Plus
Author: Zvi Danovich, Senior SW Application Engineer, Intel
Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.