This blog contains additional content for the article "Advanced Vectorization" from Parallel Universe #12:
Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
Purpose of this demo is to show an advantage of Westmere Crypto Acceleration Engine.