This will be the final post in my planned short vectorization series. Although I reserve the right to post more on vectorization in the future!
As part of my focus on software performance, I also support and consult on implementing scalable parallelism in applications.
It is time to make Parallelism a full First Class Citizen in C and C++. Hardware is once again ahead of software, and we need to close the gap so that application development is better able to uti
The N-Body problem is a classic example used frequently to demonstrate parallelization and how it improves performance.
This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
Big Data requires processing huge amounts of data. Intel Advanced Vector Extensions 2 (aka AVX2) promoted most Intel AVX 128-bits integer SIMD instruction sets to 256-bits.