Use these parallel programming resources and books with your Intel® Xeon® processor and Intel® Xeon Phi™ processor family
Intel MKL 11.3 has introduced Intel TBB support.
As I mentioned in my previous post about writing a vectorized reduction code from Intel vector intrinsics, that part of the code was just the finishing touch on a loop computing squared difference of complex values.
Intel® Parallel Studio XE is a very popular product from Intel that includes the Intel® Compilers, Intel® Performance Libraries, tools for analysis, debugging and tuning, tools for MPI and the Intel® MPI Library. Did you know that some of these are available for free? Here is a guide to “what is available free” from the Intel Parallel Studio XE suites.
In the past couple of years I've noticed a trend to "re-invent" technology or re-brand old ideas and concepts from previous computing generations.
Connecting the Dots
Matrix multiplication (MM) of two matrices is one of the most fundamental operations in linear algebra. The algorithm for MM is very simple, it could be easily implemented in any programming language. This paper shows that performance significantly improves when different optimization techniques are applied.
The unfairness of good syntax - bad syntax is a problem; good syntax is not a solution.