In my last blog, I introduced the concept of vectorization, which is parallelism across data elements in a regi
This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
Already a couple of years ago, the Bit Manipulation Instruction Set 1 (BMI1) introduced the instruction BLSR, which resets the lowest bit that is set.
One of my performance focus areas for this year is vectorization.
Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.
Intel® Math Kernel Library includes powerful and versatile random number generators that have been optimized to take full advantage of Intel
We had an ask from one of the various "Birds of a Feather" meetings Intel® holds at venues such as at the Super Computing* (SC) and International Super Computing* (ISC) conferences.